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Preface 


There is an element of truth in the old saying that the Euler textbook 
Introductio in Analysin Infinitorum (Lausannae, 1748) was the first great 
calculus textbook, and that all elementary calculus textbooks published 
since that time have been copied from Euler or have been copied from 
books that were copied from Euler. Euler, the greatest mathematician 
of his day and in many respects the greatest mathematician of all time, 
held sway when, except where the geometry of Euclid was involved, it 
was not the fashion to try to base mathematical work upon accurately 
formulated basic concepts. Problems were the important things, and 
meaningful formulations of axioms, postulates, definitions, hypotheses, 
conclusions, and theorems either were not written or played minor roles. 

Through most of the first half of the twentieth century, elementary 
textbooks in our subject taught unexplained but “well motivated” intui- 
tive ideas along with their problems. Enthusiasm for this approach to 
calculus waned when it was realized that students were not nourished by 
stews in which problems, motivations, fuzzy definitions, and fuzzy theo- 
rems all boiled together while something approached something else with- 
out ever quite getting there. About the middle of the twentieth century, 
precise formulations of basic concepts began to occupy minor but increas- 
ingly important roles. 

So far as calculus is concerned, this book attaches primary importance 
to basic concepts. These concepts comprise the solid foundation upon 
which advanced as well as elementary applications of calculus are based. 
Applications, including those that have great historical interest, occupy 
secondary roles. With this shift in our emphasis, we can remove the 
mystery from old mathematics and learn modern mathematics when we 
sometimes spend a day or two studying basic concepts and attaining 
mastery of ideas, language, and notation that are used. The mathe- 
matical counterparts of hydrogen and electrons are important, and we 
study them before trying to construct the mathematical counterparts of 
carbohydrates and television receivers. 

This book contains just 76 sections, of which only a half dozen can be 
omitted without destroying the continuity of the course. In a three- 
term course meeting thrice weekly for fifteen weeks each term, times for 
reviews, tests, and occasional excursions remain when two sections are 
covered each week. 

With few or no exceptions, each section presents each student an oppor- 
tunity to make a thoroughly sound investment of time that will pay divi- 
dends in personal satisfaction, intellectual enlightenment, and scientific 
power. The material of the section is guaranteed to be worthy of study, 
it being stoutly maintained that nobody should study inept material. 
Each student is expected to read the text and problems of each section 
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as carefully as an alert physicist reads an account of a newly developed 
nuclear reaction, and to learn as much as he can. In most cases a reason- 
able investment of time can produce satisfactory understanding of the 
text as well as solutions of several of the problems at the end of the sec- 
tion. Thus average students can make satisfactory progress. In some 
cases it is an almost superhuman task to digest all of the problems and 
remarks at the end of a section before additional mathematics has been 
studied. Thus superior students have ample opportunities to acquire 
large amounts of additional information and skill. 

To a considerable extent, this book is a book about mathematics as 
well as a mathematics textbook that teaches formulas and procedures. 
The historical and philosophical aspects of our subject are not neglected. 
The text, problems, and remarks frequently give students quite unusual 
opportunities and incentives to think and to become genuine authorities 
on developments of ideas, terminologies, notations, and theories. ‘The 
book strives to produce thoughtful articulate and perhaps even some- 
what sophisticated students who will find that their course in calculus 
gives them admirable preparation for intellectual pursuits. It is fre- 
quently said that calculus textbooks contain so little of the spirit and con- 
tent of modern mathematics that they do not enable students to decide 
whether they have the interests and the aptitudes required for life-long 
careers in pure mathematics or in another science in which mathematics 
plays a major role. Hopefully, this book does. 

The first third of the book contains all or nearly all of the information 
about analytic geometry, vectors, and calculus that students normally 
need in their introductory full-year college and university courses in 
physics. One distinguishing feature of the book is the early introduc- 
tion and continued use of vectors in three-dimensional space. These vec- 
tors simplify, clarify, and modernize our mathematics and, at the same 
time, make our course more interesting to teachers and vastly more inter- 
esting and immediately useful to students. Modern meaningful defi- 
nitions and terminologies of the calculus are used, but we retain and 
explain the standard notations so students can be prepared to live in the 
parts of the world outside their own calculus classrooms. 

The logical structure of the book should be explained. We make no 
effort to tell what points, lines, and planes are; we suppose that they 
exist, and use the axioms of the geometry of Euclid. Similarly we make 
no effort to tell what real numbers are; we suppose that the things exist 
and use the axioms that govern operations involving them. The book 
is based upon these axioms. Ifa theorem fails to have enough hypotheses 
to imply the conclusion, it is a blunder. If an assertion or definition is 
meaningless, it is a blunder. If an argument purported to be a proof or 
a derivation has a flaw, itis a blunder. If we pretend to prove a formula 
for something that has not been defined, this is a blunder. Being 
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“rigorous”? means, in mathematics, being free from blunders. “Giving 
all the rigor that a student can appreciate” means avoiding all the blun- 
ders that the student can detect. The author will not say that this book 
is rigorous because minor errors are inevitable and major blunders are 
possible, but he will say that he has tried to be rigorous. Thus students 
can be and should be invited to be critical. Detection of a blunder should 
be a major accomplishment. 

The author will be delighted if students discover everything that is 
bad in the book and everything that is good in mathematics. Teachers 
need not and perhaps should not give as much attention to the theoretical 
aspects of the subject as the text does. Because the text contains many 
of those comments and explanations that teachers are normally called 
upon to supply as answers to questions, teachers are enabled to devote 
more of their attention to problems. Problems and applications are 
important and, particularly when tests and examinations consist almost 
exclusively of problems, major emphasis must be placed upon the prob- 
lems. We will be unfair to our students if we behave like the president 
of a construction company who trains an employee to be an architect 
and then discharges him because he fails to lay bricks properly. 


Ralph Palmer Agnew 
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1 Analytic geometry 


in two dimensions 


1.1 Real numbers Without undertaking an exhaustive exposition of 
the subject, this preliminary section presents fundamental ideas about 
arithmetic, geometry, and algebra that are used throughout the book. 
While much of the material will be familiar, students are expected to read 
everything (including the problems) to assimilate information and points 
of view required for comprehension of later sections of the book. The 
rules of our game must be unmistakably clear at all times. We read 
everything, and we work some of the problems. 

The numbers with which we are most familiar are the positive real 
numbers. These are the numbers, such as $, 1, +/2, 2, r, 416, etcetera, 
that represent weights of material objects, distances between towns, 
etcetera. The negative real numbers are the negatives of these, examples 
being —4, —1, and — +/2. These positive and negative real numbers, 
together with 0, which is neither positive nor negative, constitute the set 
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of real numbers or the real-number system. Except where explicit state- 
ments to the contrary are made, the word number in this book always 
means real number. It is assumed that we are all familiar with the idea 
that numbers can be represented or approximated in decimal form. The 
equality 4 = 0.5 and the approximation 


(1.11) m = 3.14159 26535 89793 


must not frighten us. Searching questions about the possibility of 
“representing” r and other numbers by “infinite decimals” can be post- 
poned. Our decimal system was devised by Hindus and was carried to 
Europe by Arabs in the twelfth century and earlier, but it took a few 
centuries to convince Europeans that they should and could teach the 
system to all of their children. 
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Figure 1.12 


With each number x we associate a point on a line as in Figure 1.12. 
The line is called the real line or the x axis, and the point associated with 
0 (zero) is called the origin O (oh). If x is positive, say 2, the point 
associated with x lies x, say 2, units to the right of the origin. If x is 
negative, say —3, the point lies —x, say 3, units to the left of the origin. 
This correspondence between numbers and points is one to one; that is, 
to each number there corresponds exactly one point and to each point 
there corresponds exactly one number. The number is called the coordi- 
nate of the point. While points and numbers are entities of different 
kinds, we sometimes find convenience in abbreviating our language by 
using “the point x”? to mean “the point having coordinate x.” The part 
of the x axis upon which positive numbers are plotted, or located, is called 
the positive x axis. 

The statement a = b is read “a equals b” or “ais equal to b.” Simi- 
larly, the statement “a = b” is read “ais not equal to b” or “ais different 
from b.” Thus the statements 2 = 2 and 2 = 3 are true. The state- 
ments 2 Æ 2 and 2 = 3 are false. 

When two numbers a and b are so related that the point corresponding 

to a lies to the left of the point corresponding to b 
—1—_ >=: 8s in Figure 1.13, we say that a is less than b and 
Figure 1.13 write a < b. In this case we say also that b is 

greater than a and write b> a. For example, 

2<6, —3 <1, and —4 < —2. This terminology 
agrees with common usage when temperatures are being compared; we 
say that a temperature —3° is less, or lower, than a temperature 1°. The 
inequality —2 < 0 means that —2 is less than 0 and that —2 is nega- 
tive. The inequality 4 > 0 means that 4 is greater than 0 and that 4 is 
positive. The statement that the weight w, measured in pounds, of a 
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horse is greater than 1990 and less than 2010 becomes 
1990 < w < 2010. 


This can be read “1990 is less than w is less than 2010.” In this book, 
Figure 1.12 precedes Figure 1.13 because 1.12 < 1.13, and both precede 
Section 1.2 because 1.13 < 1.2; the decimal system governs the number- 
ing of all items except those appearing in the lists of problems at the ends 
of the sections. The basic idea that the number 1.131 or 1.135 can be 
assigned to an item which appears between items numbered 1.13 and 1.14 
is often used to bring order out of chaos and has hordes of valuable 
applications. While facts of life are being considered and he still has his 
full complement of readers, the author can extend his best wishes to the 
canonical 20 per cent who will not complete this course. Those who 
abandon their studies to work in the design department of a sport shirt 
factory will be rewarded for commencement of their studies if they have 
learned that items in their stocks can be identified by numbers in one 
sequence and that numbers such as 416.35 and 416.351 can be assigned to 
items that should be listed between 416.3 and 416.4. Numbers assigned 
to items in books and factories are akin to numbers assigned to buildings 
beside streets and to doors inside skyscrapers. In the best of circum- 
stances, these numbers are assigned in an informative way, and they are 
noticed and used when occasions arise. Persons who study Section 5.4 
will be well aware that they are just getting started when they reach 5.41, 
that they are about halfway through the text of the section when they 
reach 5.45, and that they have reached the problems at the end of the 
section when they reach 5.49. 

The inequality a S b is read “a is less than or equal to b.” The 
inequalities 4 < 5 and 5 < 5 are both true, but the inequality 6 £ 5 is 
false. 

The absolute value of a number x is denoted by |x|. It is equal to x 
itself if x > 0; it is equal to O if x = 0; and it is equal to —x if x < 0. 
Thus |7| = 7, |0| = 0, and |—4| = 4. For each x we have |x| = 0. 
Moreover, |x| = 0 if and only if x = 0. For each x we have either 
|x| = x or |x| = —x, and since x? and (—x)? are equal it follows that 
|x|? = x, 

With the aid of Figure 1.12, we acquire the idea that the distance 
between the point with coordinate 1 and the point with coordinate 4 is 3. 
The distance between the points with coordinates —3 and 2 is seen to be 
5, that is, 2 — (—3). By considering the different typical cases, we 
reach the conclusion that the distancet between the two points with coordi- 

T It is far from easy to formulate and use enough axioms involving the geometry of Euclid 
and the set of real numbers to prove that the number |b — al is the distance between the 
points having coordinates a and b. To place our ideas upon a rigorous base, we can do 
what is usually done in more advanced mathematics: construct the foundations of ordinary 


geometry and analysis in such a way that the number |b — al is defined to be the distance 
between the two points having coordinates a and b. 
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nates a and bis |b — al, that is, b — a when b 2 aand a — bwhend S a. 
The fundamental fact that the distance from a to b is less than or equal to 
the distance from a to 0 plus the distance from 0 to b is expressed by the 
inequality 


(1.14) la — b| S |a| + Ibl. 
Replacing b by —b in this inequality gives the inequality 
(1.15) la + b| < |a| + |b]. 


Problem 41 at the end of this section shows how this can be proved. 

We learn in the arithmetic and algebra of real numbers that x? = 0 
when x = 0 and that x? > 0 when x Æ 0. If N is a positive number, 
then there,are two values of x for which x? = N; the positive one of these 
numbers igtdenoted by «/ NÑ, and the negative one is denoted by — ~V N. 
Thus 4? = 16, (—4)? = 16, v16 = 4, and — +/16 = —4. Since 
(—4)? = 16 and +/16 = 4, we see that 


(1.16) VUE = V/16 = 4 = |4]. 


This is a special case of the formula +~/x? = |x|, which holds for each real 
number x. In particular, ~/0 = 0. 

There are times when special properties of the number zero must be 
taken into account. The facts that 0 + a =a and O-a = O for each 
number a seem to be thoroughly understood by all arithmeticians, but 
the role of zero in division may require comment here. It is a funda- 
mental fact that we write x = b/a to represent the number x that satisfies 
the equation ax = b, provided there is one and only one number x that 
satisfies the equation. If a = 0 and b = 0, then O is the one and only 
number x that satisfies the equation and therefore 


0 


- a 


= 0 (a Æ 0). 


Thus 0/a = 0 provided a ¥ 0. Ifa = b = 0, then each number satisfies 
the equation and therefore 


0. . 
ois meaningless. 
If a = 0 and b ¥ 0, then no number x satisfies the equation and therefore 
b. . 
ois meaningless 


when b #0. Thus we see that b/a is meaningless when a = 0, whether 
b is O or not; division by zero is taboo. To look at the matter another 
way, we observe that if a Æ 0, then the equation ax = ay implies that 
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x = y; but the equation 0:2 = 0-3 does not imply that 2 = 3. We must 
always be suspicious of results obtained by division unless we know that 
the divisor is not Q. 

In order to pass literacy tests and to converse with our fellow men, it is 
necessary to know that the numbers 


(1.17) L.. —4, —3, —2, —1, 0, 1,2,3,4, -+> 


are called integers. The numbers 1, 2, 3, - - - are the positive integers. 
If m and n are integers and n = 0, the solution of the equation nx = m is 
written in the form m/n and is called a rational (ratio-nal) number. Each 
integer m is a rational number because it is the solution of the equation 
lx = mandism/1. There is no greatest integer, because to each integer 
n there corresponds the greater integer n +1. Likewise, there is no 
least integer, but 1 is the least positive integer. If e (epsilon) and a are 
positive numbers, then there is a positive integer n for which ne > a; this 
is the Archimedes property of numbers. Another basic fact which is 
easier to comprehend than to prove is that if x is a number, then there is 
an integer n for whichn Sx<n-+1. 

As we near the end of this introductory section, we call attention to 
some additional terminology which is more important than beautiful 
and to which we shall slowly become accustomed as we proceed. When 
a < b, the set of points having coordinates x for which a S$ x < bis called 
the closed interval of points (or numbers) from a to b. The points a and b 
are end points of the closed interval, and they belong to (or are points of) 
the closed interval. The set of points (or numbers) for which a < x < b 
is called the open interval from ato b. The points a and b are still called 
end points of the interval, but they do not belong to the open interval. 
In each case, the number b — ais called the length of the interval. Thus 
the length of an interval is the distance between its end points. When 
a < b, the relations 


_at+bd_b-a 
2 — 2 


a+b b—a 


b ann) 


> 0, > 0 

imply that a < (a + b)/2 < b and that the point having the coordinate 
(a + b)/2 lies between and is equidistant from the points having coordi- 
nates a and b. This point is called the mid-point of the interval (open or 
closed) having its end points at aand b. Ifb < a, the above inequalities 
are reversed, but (a + b)/2 is still midway between a and b. 

The following problems promote understanding of statements made by 
use of inequality and absolute-value signs. (For example, the inequality 
148 < x < 152 says that the number x (which might be the weight of a 
man) is greater than 148 and less than 152. This means that x differs 
from 150 by a number with absolute value less than 2 and hence that 
|x — 150| <2. It is just as easy to see that if |x — 150| < 2, then 
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148 < x < 152. This is a special instance illustrating the fact that if 
a and 6 (delta) are numbers for which ô > 0, then the set of numbers x 
for which a — ô < x < a + Sis the same (see Figure 1.18) as the set of 


a—3} a x a+é x 


Figure 1.18 


numbers x for which |x — a| < 6. It is often convenient to allow ô, the 
Greek d, to make us think of a distance. When ô > 0, the set of points 
(or numbers) x for which a — ô < x < a + dor |x — a| < 6 is the set of 
points (or numbers) x having distance from a which is less than the dis- 
tance ô Thissetisaninterval. A complete understanding of the nature 
of the assertion |x — a| < happens tobe particularly important. There 
will be times when we shall use e as well as 6. For example, Problem 42 
of Problems 1.19 will invite attention to matters relating to the simple 
but important fact that if x and e are numbers for which 


(1.181) lx — 2| + |x — 3| < €, 


then e cannot be 0.01. 


Problems 1.19 


Each of the following 40 statements is followed by a question mark, which 
indicates that the statement may be true or may be false. By drawing appro- 
priate figures or otherwise, determine which of the statements are true and which 
are false. The answers (0 for false and 1 for true) are, given at the end of the list 
of statements. 


1 2<5? 2 —2> —3?! 3 7<7?? 
4 7=7?1 5 7572s 6 2< —5? 
7 —2 £ —1?;, 8 |—4| = 4?) 9 |-3| <2? 
10 —5 = 5? 11 |-2|>0? 12 |ab| = |a||b|? 


13 If is the number of pages in this book, then 75 < p < 85? 

14 If C is the circumference of a circle of radius r, then 6r < C < 6.3r? 
(Remember that C = 2rr, where r = 3.14159+.) 

IS Ifx = 5.4, then 29 < x? < 30? 

16 Ifx = 420, then 20 < ~/x < 21? 

17 Ifx = 4,then2 <x<9? 

18 ÖS <x<7,then4<x<í9? 

19 If5<x<7,then5<x<7? 

20 If5<x<7,then5 Sx 37? 

21 lf5S%<7, then5 <x <7? 

22 Ifa <b, then jal < |b|? 

23 If —2 <x <2, then 0 < x? < 4? 

24 If x? < 4, then |x] S 2? 
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25 If |x —5|<2,then3 <x <7? 

26 If |x — 2| < 0.01, then |x? — 4| < 0.0401? 

27 ‘If |x — 3| <1, then |x? — 9| < 5? 

28 Ifl<x«x<2and1<y < 2, then |x — y| <1? 
29 |(3.05)(3.06) — 9| < #? 

30 There is no real number x such that x? = —1? 
31 If0 <x <1, then —1 < —x < 0? 

32 Ifa<x<b, then —b < —x < ~a? 

33 If0<x<landO<y<1l,thnO<e+y<1? 
34 Ti0 <x <1, thenO < 2x <1? 

35 If |x — al S ô, then |x — al < ô? c 

36 If |x — aj < 6, then |x — a| S 5? 1 

37 Ifô= and0 <x S 1, then |x? — $| < ô? 
38 Ifô= and0 Sx < 1, then |x? — $| S ô? 
39 Ifô= and0 <x <1, then |x? — $| < 6? 
40 If5=%and0 <x <1, then |x? — 3| < ô? 
Answers, 0 for false and 1 for true: 


A 


5 10 15 20 25 30 35 40 
11011 01100 11011 11101 00011 10111 11000 10111 


41 We learned while studying arithmetic and algebra that the product of 
either two positive numbers or two negative numbers is positive, while the product 
of a positive number and a negative number is negative. Supposing that x and y 
are nonnegative, use the identity 


(1) (y — x)(y + x) = y? — x? 

to show that x < y if x? S y% Hence show that the inequality 
(2) ja + b| < jal + lèl 

will be true if |a + b|? < (la| + |d|)? or 

(3) (a + b)? S a? + 2ļaļjb| + b. 


Finally, show that (3) is true and hence that (2) is true. 
42 Sketch several figures which lead to the conclusion that if x and € are num- 
bers for which a 


(1) Ix — 2| + |x — 3| < €, 


then e > 1. Remark: Without using figures, we can prove the result by observing 
that if (1) holds, then 


1 = |3 — 2| = |(@ — 2) — (x — 3)| S |x — 2| + |x — 3| < € 


and hence e > 1. 
43 Using the ideas of the preceding problem, prove that if a, b, x, and e are 
numbers for which 
|x — al + |x — b| < e, 
then e > |b — al. 
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44 Let h be positive and let X (lambda) be greater than 1. Observe that 
Figure 1.191 shows the correct positions of six points having the coordinates 


A: O P; B Po P, 
an E a aA 
—h 0 A-1 h 41 +1 

Atl h i=l h A171 h 


Figure 1.191 


shown there when ÀA = 2. Make a new figure which shows where Po, Pı, and P, 
should be when A = 10. Make another new figure which shows where Po, Pi, 
and P, should be when A = 4. 

45 Referring to Figure 1.191 and supposing that h > 0 and A > 1 as before, 
show that Po is the mid-point of the line segment with end points at P; and P}. 

46 When an appropriate time comes, we shall prove that there is a positive 
number, denoted by the symbol 1/2, whose square is 2. Everyone should know 
that »/2 is not rational, and students possessing requisite time and acumen 
should become familiar with a proof. We prove the fact by obtaining a con- 
tradiction of the assumption that +/2 is rational and hence that ~/2 is repre- 
sentable in the form +/2 = m/n, where m and n are positive integers. We use 
the fact that 28 = 22:7 and the more general fact that each positive integer n 
is representable in the form n = 2%s, where g is a nonnegative integer and s is 
one of the odd integers 1, 3, 5, 7, -- -. If we suppose that »/2 = m/n, then 


2 2P°r\2 2 2p x2 
(1) 2 (z 24s 22452 


where p and g are nonnegative integers and r and s are odd integers. In case 
q 2 >, (1) gives 


(2) 21t2q-2p 52 = r2, 


and this is false because the left side is divisible by 2, while the right side, being 
the square of an odd integer, is odd and is not divisible by 2. In case p > q, 
(1) gives 


(3) s2? = 222g 1p? 


and this is false because the right side is divisible by 2 while the left side is not. 
This proves that +/2 is not rational; the assumption that +/2 is rational leads 
to false conclusions. Remark: It is possible to give different proofs of this result 
and of the more general fact that if n is a positive integer which is not one of the 
perfect squares 1, 4, 9, 16, 25, 36,49, - - - , then y/n is irrational. The standard 
proofs depend, in one way or another, upon fundamental facts about factoring 
positive integers. 

47 Persons who make desk calculators do their menial arithmetical chores 
can get very good approximations to square roots by use of an excellent method 
which involves some very interesting arithmetical ideas. When we want to 
approximate the square root of a positive number 4 given in decimal form, we 
put 4 in the form 4 = 10%, where n is an integer and 1 § a < 100, and use 
the fact that V£ = 10° v/a. To obtain good approximations to ya, we start 
with a given first approximation xı for which 1 $ x, S$ 10 and calculate some 
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more elements of a sequence xı, %2, x3, * * * of successively better approxima- 
tions. If we suppose that x, (where n = 1 when we start) is one of these num- 
bers which is different from v'a, we can absorb and prove the idea that +/a 
should lie between x, and @/%n. Do it. We then examine the tentative but 
sensible suggestion that the average of x, and a/x, may be a better approxima- 
tion to V/a. With this motivation, let 


1 
(1) ntl = 5 (+. + 2) 
and prove that 
o ma- View} (mn —2VE42) 
and hence that 


(3) tan — Va = z- (aa — Va)’. 


If we have not already picked up the idea that squares of small numbers are much 
smaller, we can start by observing that (0.2)? = 0.04, (0.04)? = 0.0016, (0.0016)? 
= 0.00000 256, and (0.00000 3)? = 0.00000 00000 09. This leads us to the idea 
that if x, is a good approximation to ya, then xn;; is much better. In fact, 
if one approximation x, is correct to k decimal places, we expect the next approxi- 
mation %n41 to be correct to about 2k decimal places. Jumps from 3 to 6 to 12 
to 24 are quite amazing. Calculations based upon (1) can be made very rapidly. 
When x, and a/x, agree to 10 decimal places and va lies between them, we have 
very solid information. The method has another feature that even professional 
computers like. Mistakes made before the final calculation do not produce an 
incorrect answer, because using an erroneously calculated approximation is 
equivalent to starting off with a different first approximation. There is even a 
possibility that mistakes may be helpful. 
48 Supposing that 0 < a < b, prove that 


— — a)? 
0< 2T? t b — ab < (b — a)’, 
8a 
Hint: Obtain and use the equality 


a+b stt 
> — Vab 4 + Vab a)? 
1 z 4 Sab Ua + b) +4~/ab 


Use the fact that if a quotient has a positive numerator and a positive denomi- 
nator, we obtain a greater quotient when we replace the denominator by a smaller 
positive number. 


Remark: Persons who study science and philosophy can learn that noble but 
basically ineffective efforts have been made to prove that points, lines, planes, 
and numbers really exist in our physical universe, and to tell precisely what these 
things are. It is the opinion of the author that discussions of such matters have 
no place in a calculus textbook. As the preface says, we assume that these 
mathematical things exist (at least as “mathematical models”) and we make the 
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usual assumptions about them. We can hear many different and even contradic- 
tory tales about the world, but we can always be cheered by the fact that our 
assumptions are universally considered to be interesting enough and useful enough 
to be worthy of study. Absorbing these ideas may not keep us young and fair, but 


we need the ideas to be debonair. 


1.2 Slopes and equations of lines When we study trigonometry, we 
learn about the plane rectangular coordinate system shown in Figures 


Figure 1.21 


Figure 1.22 


1.21 and 1.22 and we become familiar with the formulas 


sin 6 = 
(1.23) 


cos § = 


aja Ril 


, 


, 


tan 0 = 


cot 0 = 


) sec § = 


o] csc 0 = 


QIR Rj 
gls RIN 


which define the six basic trigonometric functions.t In each figure, the 
horizontal axis is the x axis, the vertical axis is the y axis, and the inter- 
section O of the two axes is the origin of the coordinate system. Since the 


Figure 1.24 


matter will be of great importance to 
us, we review the standard procedure 
for plotting (or locating) points whose 
coordinates are given. To plot the 
point 4(3,2), the point 4 whose coor- 
dinates are the positive numbers 3 
and 2, we start at the origin and go 3 
units to the right (in the direction of 
the positive x axis) and then go 2 
units up (in the direction of the posi- 
tive y axis) to reach the point 4 of 
Figure 1.24. To plot the point B 


whose coordinates are the negative numbers —4 and —2, we start at the 
origin and go 4 units to the left (in the direction of the negative x axis) 
and then go 2 units down (in the direction of the negative y axis) to reach 
B. Everyone should examine Figure 1.24 to see that the other points are 
correctly plotted. The signs of the coordinates tell us which ways we go, 


f Our rigorous presentation of angles and trigonometric functions will come in Chapter 8. 
Meanwhile we shall very often review and use facts about angles and trigonometric func- 


tions that are learned in trigonometry. 
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and the absolute values of the coordinates tell us how far we go. The 
quadrant (or subset) of the plane consisting of points having nonnegative 
coordinates is called the closed first quadrant. The quadrant (or subset) 
of the plane consisting of points having positive coordinates is called the 
open first quadrant. The Roman numerals of Figure 1.24 show us how the 
quadrants are numbered. 

Figure 1.25 shows a line L which slopes upward to the right. The line 
L does not necessarily pass through the origin, but we suppose that it 


P(x, 
11,91) A 


Figure 1.25 Figure 1.251 


passes through a given point (1,91). The angle @ (theta) lies between 0 
and 2/2 (that is, between 0° and 90°), and tan 9 is called the slope of the 
line and is denoted by the letter m, so that 
y— yı 


(1.252) m= slope = tan 0 = 
x — Xi 


when (x1,yı) and (x,y) are two different points on L. ima 
Figure 1.25 1 shows a line L which slopes downward TIAA 111 
to the right. This time tan 9 is negative, butitis | [VY] J C] 
still called the slope of the line. We must always T AT 
remember that lines which slope upward to the right 
have positive slopes and lines which slope downward {E7 [| 
to the right have negative slopes. For horizontal -ANS 
lines, we have 6 = 0, so tan 0 = O and m = Q; thus, TW TT m= —3— 
horizontal lines have slope zero. For vertical lines, “| [TAN | [ [_| 
we have 6 = 2/2 (or @ =90°), so tan 0 does not -HAAHI 
exist; thus, vertical lines do not have slopes. To TTT NAL LL 
locate a second point on a line which passes through | | N Ym- 
a given point and has slope m, we start at the given Lt | Nm=—3 
point, go 1 unit to the right and then go m units in mae ere T 
the direction of the positive y axis. When m < 0, a 
journey of m units in the direction of the positive y Figure 1.253 
axis is always interpreted to be a journey of |m| 
units in the direction of the negative y axis. Everyone should look at 
Figure 1.253 and think about this. 

Theorem 1.26 A point P(x,y) lies on the line which contains the 
pont Pi(x1,y1) and has slope m if and only if its coordinates satisfy the point- 
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slope equation (or formula) 
(1.27) y — yı = m(x — x). 


Proof of this theorem is very simple. When x = xı, the point P(x,y) 
lies on the line if and only if y = yı and hence if and only if (1.27) holds. 
When x ¥ x1, the point P(x,y) lies on the line if and only if (1.252) holds 
and hence if and only if (1.27) holds. This proves the theorem. Formula 
(1.27) is known as the potnt-slope formula, and it must be permanently 
remembered. 

In accordance with general terminology which we shall introduce in 
Section 1.5, (1.27) is an equation of the line which passes through the point 
(x1,y1) and has slope m. Moreover, the line is the graph of the equation. 
When we are required to obtain an equation of the line which passes 


through the point (4,—+4) and has slope 3, we put xı = $, yı = —4, 
m = 3, and write immediately 
y+ = 3 — §). 


Sometimes, but not always, it is desirable to put this equation in one of 
the forms 


y = 3x — 4, 12x —4y — 7 = Q, 
and we tolerate the custom which allows any one of the three equations 


to be called “the” equation of the line. Conversely, when we are required 
to draw or sketch the graph of the equation 


y +4 = 3 — 4), 

we observe that the equation has the point-slope form with xı = 4, 
yı = —4, m = 3 and then immediately draw the line through the point 
(4,—+) having slope 3. Problems at the end of this section provide 
opportunities for practice in the art of using these ideas quickly, neatly, 
and correctly. 

When, as sometimes happens, we want to find an equation of the line 
which passes through two given points P1(x1,y1) and Pa(x2,y2) for which 
x2 Æ x1, we determine the slope m of the line from the formula 


Sa 


XQ — Ky x1 — X2 


(1.28) m = OI or VOY 


and then use the point-slope formula. For example, the slope of the line 
passing through the points (3,—4) and (—2,1) is —5/5 or —1, and the 
equation of the line through these points is y + 4 = —1(x — 3). 


Problems 1.29 
1 With Figure 1.24 out of sight, plot the points 4(3,2), B(—4,—2), C(4,0), 


D(0,—2), and F(—2,3). If correct results are not obtained, read the explanations 
of the text and try again. 
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2 Plot the points 4(—7,—1), B(—5,0), C(—3,1), D(—1,2), and £(5,5). 
(These points all lie on a line, and the figure should not contradict this fact.) 

3 Plot the points (6,2), (2,6), (—6,2), (—6,—2), (—2,—6), (2,—6), and 
(6,—2). (These points all lie on the circle with center at the origin and radius 
4/40, and the figure should not contradict this fact.) 

4 Three vertices of a rectangle are (4,—1), (—6,—1), and (4,5). Sketch the 
rectangle and find the coordinates of the fourth vertex. 

5 For each of several values of x, plot the point P(x, 2 — x). What can be 
said about the resulting set of points? 


6 Plot the points P,(x1,y1), Po(xa,ye), QOlx1 + xa yi + y2, R(E, 


Jı T ye? t 2) and make an observation about the figure obtained by drawing the 


line segments from these points to each other and to the origin when 


(a) xı = 6, yı = 0, x2 = 0, y: = (b) xi = 2,91 = 5, xo = 6, yo = 3 
(c) xı = —2, yı = —4,42=7,y2=1 (d) xı = —1, yı = 1l, x2: = 1, y2 =0 


Ans.: The figure is a parallelogram together with its diagonals. Remark: Invest- 
ing time in a good problem can produce dividends. Observe and remember that 
the mid-point of the line segment joining (x1,y1) and (x292) is (25%, zty), 
More information about such matters will appear in the next chapter. 

7 Draw the triangle having vertices at the points P,(—3,1), Pa(7,— 1), Ps(1,5). 
For each k = 1, 2, 3, let mz be the slope of the 
side opposite the vertex P}. Work out the 
slopes shown in Figure 1.291 and observe that 
the answers look right. 

8 Show that the equation of the line P,P; 
of the preceding problem is y =x + 4. Find 
the x coordinate of the point on this line for 
which y = 0. Ans.: The answer is —4, and 
inspection of Figure 1.291 shows that this answer 
looks right. 

9 When numerical values are assigned to the coordinates (%1,1) of a point Pı 
and to m, it is possible to plot the point P, to sketch the line L through Pı having 
slope m, and (provided m = 0) to estimate the x coordinate xo of the point 
(x0) on L for which y = 0. It is then possible to find the equation of L and 
determine xo algebraically. Do all this and make the results agree when 


Figure 1.291 


(a) (xuy) = (1,2), m = 1 (b) (xuy) = (1,2), m = —1 
(e) (xy) = (—2,1), m = 1 (d) (xuy) = (—2,1), m = —1 
(e) (x191) = (—2,—3), m = 5 (f) (1,91) = (4,—2), m = 2 
(g) (%1,91) = (—1,4), = b (h) (1,41) = (1,1), m=1 


10 When numerical values are assigned to the coordinates (x1,y1) and (x2,42) 
of two points P, and Pz», it is possible to plot these points, to sketch the line L 
through them, and (except when L is parallel to the x axis) to estimate the x 
coordinate of the noint (9,0) on L for which y = 0. It is then possible to find the 
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slope m of L, to find the equation of L, and to determine xo algebraically. Do all 
this and make the results agree when 


(a) (1,91) = (1,1), (x22) = (3,2) 

(b) (way) = (1,—1), (x392) = (3,1) 

(c) (%1,91) = (—4,—2), (22,92) = (—1,—1) 
(d) (x191) = (0,4), (x292) = (1,2) 

(e) (x191) = (1,2), Gaye) = (2,1) 

(f) (x191) = (—1,1), (x292) = (4,—1) 


11 Plot at least five points P(x,y) whose coordinates satisfy the equation 
y = 2x — 4. The coordinates can be found by giving values such as —1, 0, å, 
1 to x and calculating y. Observe that these points appear to lie on a line L. 
Show that the given equation can be written in the form y — 0 = 2(x — 2) and 
hence that the points must lie on the line L through the point (2,0) which has 
slope 2. Make everything check. 

12 Supposing that a and b are nonzero constants, find the point-slope form 
of the equation of the line L through the two points (4,0) and (0,5), and show that 
this equation can be put in the forms 


bx + ay — ab = 0, -+o=1. 


The second form is the intercept form of the equation of L. Note that it is very 
easy to put y = 0 and see that L intersects (or intercepts) the x axis at the point for 
which x = a. It is equally easy to put x= 0 and see that L intersects (or inter- 
cepts) the y axis at the point for which y = b. 

13 A line intersects the x axis at the point (2,0) and cuts from the first quad- 
rant a triangular region having area 4. Find the equation of the line. Ans.: 
2Ax + ay = 2a. 


14 For each of the cases 


(a) Py = (1,1), Pe = (7,1), Ps = (7,7) 

(b) P, = (2,2), P: = (8,2), P; = (8,8) 

(c) P, = (—3,—1), P: = (2,—7), P; = (4,1) 
(d) Pı = (—4,—2), P: = (1,—8), Ps = (3,0) 
(e) Pi = (—2,—4), P: = (1,—5), P; = (2, —3) 


sketch the triangle having vertices P1, Pa, P3 and the line L containing P, and the 
mid-point of the side opposite P;. Use the figure to obtain an estimate of the 
x coordinate of the point where L intersects the x axis. Then find the equation 
of L and determine the coordinate algebraically. Produce results that have 
reasonable agreement. Remark: There is one respect in which many problems 
in pure and applied mathematics are like this one. Graphs or something else 
give more or less good approximations to answers, but we need equations to get 
correct answers. When equations give answers that seem to be wrong, the whole 
situation must be given close scrutiny. Mistakes in sign are particularly damag- 
ing, and we all make mistakes when we work too rapidly or too thoughtlessly. 
15 A triangle with vertices 4, B, C is placed upon a coordinate system in 
such a way that Æ is at the origin and the mid-point D of the opposite side is 
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the point (4,0) on the positive x axis as in Fig- 
ure 1.292. Supposing that the coordinates of C 
are (h+ a, k), show that the coordinates of B 
must be (A — a, —k). Find equations of the 
lines containing the sides of the triangles. Write 
the equations in the form y= mx + b and check 
the answers by determining whether the coordi- 
nates of the vertices satisfy the equations. O B(h—a, —R) 

16 The vertices P1(«1,y1), Po(%2,y2), Ps(%a,ys) Figure 1.292 
of a triangle are unknown, but it is known that 
the mid-points of the sides P}P2, P:P3, and PsP) are respectively (7, —1), (4,3), and 
(1,1). Find the unknowns and check the results by drawing an appropriate 
figure. 

17 Formulate and solve a more general problem of which Problem 16 is a 
special case. 


C(h+a, k) 


1.3 Lines and linear equations; parallelism and perpendicularity 
When 4, B, and C are constants{ for which 4 and B are not both 0, the 


equation 
(1.31) Ax+By+C=0 


is a linear equation and we must both prove and remember that its graph 
isa line. Incase B # 0, we can put the equation in the point-slope form 


and see that the graph is the line L which passes through the point 
(0, —C/B) and has slope ~4/B. Incase B = 0, we must have 4 ¥ 0, 


and we can put the equation in the form 


C 


% = =—-> 


The graph of this equation is the vertical line consisting of all those points 
(x,y) for which x = —C/4. This proves the 
result. 

The equation 


can be put in the form y — b = m(x — 0), and 
hence it is the equation of the line L which passes 
through the point (0,b) and has slope m. The equation (1.32) is called 


Figure 1.33 


t The hypothesis that 4, B, and C are constants means merely that they are numbers that 
are “given” or “selected” or “fixed”? in some way. There is no implication that other num- 
bers are unstable in the sense that they are moving around. We shall hear more about this 
matter later. 
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the slope-intercept formula. The easiest way to find the slope m of the 


line having the equation 2x — 3y — 4 = Q is to solve the equation for y 
to obtain 


y = Bx — 4 
and see that m = %. 

Let Lı and L, be two lines which are neither horizontal nor vertical and 
let their slopes be mı and m. Figure 1.34 reminds us of the elementary 
fact in plane geometry that Lı and L are parallel if and only if 6, and 6, 
are equal and hence if and only if tan 0. = tan 6; and mz = mı. Thus 
Lı and Lı are parallel if and only if their slopes are equal. 


Le Lı L Ly 
0+ = 
0. 6, 6, 
x x 
Figure 1.34 Figure 1.35 


For perpendicular lines, the story is more complicated. The lines Lı 
and L: are perpendicular if and only if their slopes mı and mz are negative 
reciprocals, that is, mz = —1/m, or mı = —1/m: or mum, = —1. To 
prove this, we observe that Lz and Lı are perpendicular if and only if 
02 = 0, + 27/2 as in Figure 1.35 or 6; = 02 + 4/2 when the roles of Lı 
and L are reversed. In the first case we have 


(1.351) m = tan +5) = — cot, = — -5 = -> 
and the result follows. To get the result in the second case, we merely 
reverse the roles of Lı and Le. 

As in Figure 1.36, let Lı and L: be two 
nonvertical lines and let œ (phi) be the 
angle through which a line must be rotated 
to bring it from coincidence with L, to coin- 
cidence (or parallelism) with Lz. When we 


Fi 1.36 can find the slopes m, and m: of Li and Ls, 
‘eure L we can determine @ from the fact that 
¢ = 1/2 when mm, = —1 and 
— M — Mı 
(1.37) tan œ Tt wm, F mms 


when mmo Æ —1. The latter formula is proved by the formula 


1.371) ta — A _ tan 6. — tan 4, _ Mı- m 
) n $ = tan (02 ~ 61) 1 + tan 0, tan 6. 1 + mm 
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which employs the standard trigonometric for- 
mula for the tangent of the difference of two 
angles. When we are asked to find ¢, we pre- 
sent tan ¢ as the answerf to our problem. 

As an application of some of the above ideas, 
we find the equation of the line Z of Figure Figure 1.38 
1.38. The positive number p is the distance 
from the origin to the point Pı, and L is perpendicular to the line 
OP, at Pı. The coordinates of P, are p cos æ (alpha) and p sin æ. The 
slope of OP; is tan a, or sina/cos a, and the slope of L is the negative 
reciprocal — cos a/sina. Use of the point-slope formula gives the equa- 
tion of L in the form 


. COS @ 
(1.381) y—psna= ——— (x — p cos a). 

Multiplying by sin a and using the identity sin? a + cos? a = 1 puts the 
equation in the more attractive form 


(1.382) (cos a)x + (sin a)y = b. 


The line OP;, being a line perpendicular to L, is called a normal to L, and 
the equation (1.382) is called the normal form of the equation of L because 
it gives information about this normal. It is sometimes thought to be 
worthwhile to know a speedy way to put the equation 4x + By + C = 0 
into normal form. We suppose that 4 and B are not both O and that 
C #0. The trick is to transpose C to obtain 4x + By = —C and then 
divide by one of + V£? + B? to obtain 


A B -C 
1.383 — a — = — _? 

UO) SVE TE ee? 4 eSB 

where the sign is so chosen that the right side is positive. This equation 
has the normal form. The right side is the distance p from the origin to 
the line, and the coefficients of x and y are respectively the numbers cos a 
and sin a which determine the angle a of Figure 1.38. 


Problems 1.39 


1 Each of the following equations is the equation of a line L. In each case, 
find the slope m by finding the coordinates of the points in which the line inter- 
sects the coordinate axes and then finding the slope of the line through those 


t Traditionally, students are required by tests and examinations to findtan¢. Accord- 
ingly, students who hope to pass examinations by learning a few formulas—and those who 
aspire to a substantial understanding of their subject—are well advised to learn the neces- 
Sary ritual. 
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two points. Then find m by putting the equation in the form y = mx +b 
Make the results agree. 


(a) x+y=2 (b) x+y=3 
(c) 2x+y=2 (d) x+ 2y =2 
(e) 2x + 3y—4=0 (f) 2x+3y+4=0 
(g) 2x — 3y- 4=0 (h) 2x — 3y+4=0 
0 î+3=1 (j) F—-3=1 


2 Draw the triangle having vertices at the points P,(—3,1), P2(7,—1), 
P;(1,5), and observe that P:P: seems to be nearly perpendicular to PPs. Find 
the equation of the line through Ps perpendicular to PPs and show that this line 
does contain the point Pz (The figure appears among the problems at the end 
of Section 1.2.) 

3 For the points Pı, Ps, Ps of the preceding problem, find the equation of 
the line through P» parallel to the line P,Ps. Find the coordinates of the point 
in which this new line intersects the y axis. Put this new line into the figure, 
and make everything check. 

4 For each of the following equations find numerical coordinates of three 
points Pı, Ps, Pa whose coordinates satisfy the equation. If you cannot think 
of a better procedure, let x, = 0, xz = 1, x = 2 and calculate yi, ye, ys. Plot 
the three points Pı, Pe, Ps and notice that they seem to lie on a line. Calculate 
the slopes of P,P: and P,P; and observe that they are equal. Observe that there 
is ample opportunity to check all answers. 


(a) y=x+1 (b) y= 2x+ 3 
(c)x+t+y=5 (d4)x+ty+2=0 
(e) 2x —3y+4=0 (A 2x +3y+4=0 


5 Plot the lines having the equations y = 2x and y = 3x and observe that 
the acute angle @ between them seems to be rather small. Find @¢ by finding 
tan ¢, and then construct and examine an appropriate figure to see that your 
answer seems to be correct. 

6 Supposing that & is a nonnegative number, find the acute angle between 
the lines having the equations y = kx and y = (k + 1)x. Check the answer in 
at least one special case. Tell why the angle should be small when £ is large. 

7 Sketch the line Lı which intersects the coordinate axes at the points (0, —4) 
and (5,0), and the line La, which intersects the coordinate axes at the points 
(0,—5) and (6,0). Find the acute angle between the lines. Ans.: tan 0 = $y. 

8 While assembly lines and mass production reduce costs of manufactured 
items, there is an element of sanity in the idea that the total cost y of publishing 
x copies of a book is ax + b. Sketch a graph of the equation y = ax + b and 
discover the significance of the numbers a and b. 

9 Find the equation of the perpendicular bisector of the line segment joining 


the points P:(x1,y1) and Po(x,ye), putting the answer in the form 4x + By = C. 
Ans.: 


2 — 2 2 a2 
(x2 — xi) + (ye — yi)y ar 
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10 Any given rectangle can be placed upon the x, y coordinate system in such 
a way that its vertices are (0,0), (0,a), (2,0), and (b,a). Prove that if the diag- 
onals are perpendicular, then the rectangle is a square. 

11 Sketch a figure showing the triangle having vertices at the points Pi(*1,y1), 
P2(x2,2), and P3(xs,y3). For each k = 1, 2, 3, mark the mid-point Q; of the side 
opposite P, and find the coordinates of Q}. Supposing that the line QoQ, is not 
vertical, calculate its slope and show that it is parallel to the line P,P». 

12 Prove analytically (by calculating slopes) that the mid-points of the sides 
of a convex quadrilateral are vertices of a parallelogram. Remark: Taking ver- 
tices at (1,1), (2,2), (%s)3),(%4,y4) produces “symmetric” formulas. 

13 Show that the lines having the equations 


aix + biy = cy 
AX + bey = Co 


are parallel if and only if abe — abı = 0. If the lines are not parallel, they 
must intersect at a point P(x,y) whose coordinates satisfy both equations. 
Assuming that the lines are not parallel, solve the equations to obtain the formulas 


bet, — bico ıl — a1 
= oO? (a 
aib: _ abı y aibe — abı 


for the coordinates of the point of intersection. Remark: Those who have for- 
gotten how to solve systems of linear equations can recover by noticing that we 
can multiply the first equation by b and the second by —b, and then add the 
results to eliminate y and obtain an equation that can be solved for x. This 
process is known as the process of successive elimination. 

14 Copy Figure 1.292 and then find the equations of the three medians of the 
triangle and show that these medians intersect at the point (24/3, 0). Remark: 
Since the median placed upon the x axis could have been any median of the tri- 
angle, this provides a proof that the three medians of a triangle intersect at the 
point which trisects each of them. 

15 Show that the lines obtained by giving constant values to k in the equation 


2x + 3y +k =0 


are all parallel. Show that the line Z having the equation 
2(x — x) + 3(y — y) = 0 


belongs to this family and contains the point (1,41). 
16 Show that if the lines 4P and BP joining the points 4(1,2) and B(5,—4) 
to P(x,y) are perpendicular, then 


(œ — I) — 5) + +4) — 2) =0. 


Remark: Persons well acquainted with elementary geometry should know that 
P must lie on the circle having the line segment 4B for a diameter. 

17 Put the following equations into normal form and check the results by 
drawing graphs showing the lines having the given equations and the line seg- 
ments through the origin perpendicular to these lines. 
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(a) 2x + 3y = 12 (b) 2x — 3y = 12 
(c) 2x — 3y = —12 (d) 2x + 3y = —12 
(e) y = 2x +3 (f) y =0 


18 Let k>O. For each of several values of w, draw the perpendicular 
bisector of the line segment joining the points (0, 1/4) and (w, —1/4k). Then 
determine the condition which x and y must satisfy if the point P(x,y) lies 


(a) on exactly one of these bisectors, 
(b) on more than one of these bisectors, 
(c) on none of these bisectors. 


Ans.: (a) y = kx?, (b) y < Rx®, (c) y > Rx®. 
19 For what pairs of values of b and c do the two equations 


3x +by+c=0 
cx — 2y + 12 =0 


have the same graph? Partial ans.: There are two pairs which are easily checked 
after they have been found. 

20 The points P:(x1,y1), Po(x2,y2), Ps(%s,ys) are vertices of a triangle. Find 
the coordinates (x,y) of the point Qı where the line through P, perpendicular to 
the line PP; intersects the line PPs. Partial ans.: 


yee GR Per + (ys — yi) (Ys — yaa + 2 — 91) 02—93) Ho, 
(xs — x2)? + (ys — y2)? 


Remark: It is possible to write the answer in different forms. This form enables 
us to check quickly that interchanging the subscripts 2 and 3 does not change 
the value of x. Such checks are often used to guard against clerical errors in 
deriving or copying formulas. 

21 Let the vertices of a triangle be Pi(%1,y1), Po(x2,y2), Ps(xs,ys). For each 
k, let Ly be the line containing P, which is perpendicular to the line containing 
the other two vertices. Recognizing that altitudes are numbers (not line seg- 
ments or lines), we call the lines Zi, L2, Ls the altitudinal lines of the triangle. 
Prove that these altitudinal lines are concurrent. Remark: The conclusion means 
that there is a point Po, called the orthocenter of the triangle, at which the three 
altitudinal lines intersect. When asked to prove the conclusion by synthetic 
methods, we use our ingenuity (or that of some other people) in searches for 
appropriate figures and ideas upon which the proof can be based. When asked 
to prove the conclusion by analytic methods, we can proceed at once to apply a 
powerful method that cannot fail to produce results if we do the chores correctly 
We can find the equations of the three altitudinal lines and use two of the equa- 
tions to find the coordinates of the point of intersection of two of the lines. If 
this point lies on the third line, the conclusion is true. If (for some triangle) 
the point fails to lie on the third line, the conclusion is false. The chores can be 
done in the following way. Considering separately the case in which the line 
P:P}; is neither horizontal nor vertical (so that this line and L, have slopes) and 
the cases in which PPs is horizontal or vertical, we can find that the equation 
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of LZ; is the first of the equations 


(1) (x3 — xx + (ys — yo)y = (x3 — x)xı + (ys — y2)91 
(2) (xı — x3)x + (yı — ya)y = (xı — x3)x2 + (yı — y3)y2 
(3) (xa — xi)x + (ye — yı)y = (x2 — xı)x3 + (y2 — yi) ys. 


It is possible to repeat the process to show that the equations of Lz and Ls are 
(2) and (3). It is, however, more fun to observe that we can convert the deriva- 
tion of the equation of Lı into a derivation of the equation of L: by making a 
“cyclic advance” of the subscripts so that 1 goes to (or is replaced by) 2, 2 goes 
to 3, and 3 goes to 1. The first cyclic advance converts (1) into (2), and another 
cyclic advance converts (2) into (3). The routine way to finish the proof is to 
solve (1) and (2) for x and y and show that these numbers (x,y) satisfy (3) 
However, if we do not want to obtain and preserve the formulas for x and y, we 
can finish the problem by observing that adding the members of (1), (2), and (3) 
gives 0 = 0 and shows that the third equation is satisfied whenever the first two 
are satisfied. For the record, we note that solving (1) and (2) for x and y gives 
the formulas 


yilxg — x2)xı + yalxı — x3)xa + ya(x2 — xı)%3 
(4) x — (y2 — y1) (ys — ye) (91 — ys) 
yi(%3 — x2) + yo(x1 — x3) + y3(%2 — xı) 
xi(ys — y2)yı + x2(y1 — y3)y2 + x3(ye — yi) ys 
— (x2 — x1) (x3 — x2) (x1 — x3) 
xi(ys — y2) + x(yı — ys) + xa(y2 — yı) 


(5) 9y 


for the coordinates of the orthocenter. With the aid of the identity 
(6) (y2 — y1) (ys — y2) (y1 — ys) = yı(y3 — y2) + y(yi — y3) + ya(y2 — y1), 


we can put these formulas in the forms 


yilæilxs — x2) + yi — yal + yæl — x3) + y3 — yi] 
+ yslxs(x2 — x1) + y? — y3] 


7 = 
( ) ” yi(xs - %2) + yalxı — %3) + ya(xe — %1) 
xalyi(ys — yo) + x3 — x3] + xelye(yr — ys) + x3 — xi] 
(8) y= + xslys(yo — y1) + xf — x3] 


xi(ys — ye) + xyi — y3) + x3(y2 — yı) 


and in many other forms which look quite different. Interchanging the x’s and 
y’s in one of these formulas gives the other. Except for sign, the denominators 
are equal to each other and (as we shall see later) to twice the area of the triangle. 
As is to be expected, a cyclic advance of the subscripts does not alter the triangle 
and does not alter the formulas for the coordinates of the orthocenter. 

22 Again let the vertices of a triangle be Py(x1,1), Pe(x2,2), P3(x3,3). Find 
the coordinates of the point (x,y) of intersection of the three perpendicular bisec- 
tors of the sides of the triangle, and write also the result of making a cyclic advance 
(see the preceding problem) of the subscripts appearing in the answer. Remark: 
Elementary plane geometry shows that the point (x,y) is equidistant from the 
three vertices and hence is the center of the circle containing the vertices. The 
circle is called the circumcircle (or circumscribed circle) of the triangle, and its 
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center is called the circumcenter of the triangle. The answer can be put in the 
forms 


y(x — %2)(%3 + x2) + yo(%1 — x8)(%1 + x3) + ya(x%2 — xı) (x2 + x1) 
+ (ye — Y1) (y3 — Y2) (Yı — ys) 
(1) x= 2yi(es — x2) + ye(xı — x3) + alee — x1)] 
xilya — y2) (ya + ya) + x2(y1 — ya) (yı + ya) + xa(y2 — y1) (y2 + y1) 
+ (x2 — %1)(%3 — x2) (xı — x3) 
(2) y= 2[x1(y3 — ya) + x(yı — y3) + x%a(y2 — yı)] 
and 2 2 2 2 2 2 2 2) 4 (x2 + 2 Ly 2y 
3) x _ rites t 93 — xz — yo) + ye(xi + Yi — x3 — 93 Yal%o T Y2 1— yı 
( 2[yi(x%3 — x2) + ya(xı — x3) + ya(x2 — *1)] 
4) y= xalx Hy — x — ya) + li + yi wa — ya) + xalxs + yo — xi — yi) 
) y= 2[x(y3 — y2) + x(yı — y3) + xa(y2 — yı)] 


and in still other forms which look quite different. 


Po (Orthocenter) 


Nine-point circle 


N 
Y C (Cırcumcenter) 
Figure 1.391 


23 The triangle in Figure 1.391 has vertices at Pi(%1,y1), P2(x%292), and 
P3(xa,y3). The mid-points Mı, M2, M3 of the sides of this triangle are the ver- 
tices of the mid-triangle of the given triangle. With or without making use of 
the ideas and results of the preceding problem, find the coordinates of the cir- 
cumcenter of this mid-triangle. Remark: The answer can be put in the form 


yıls — x2) (2x1 + x2 + x3) + yalxı — x3) (2x2 + x3 + xı) 
(1) x = + yalxe — x1) (2x3 + x1 + x2) — (y2 — yı) (y3 — yo) (yı — y3) 
4iyi(x3 — x2) + y2(%1 — x3) + ya(x2 — xı)] 
æ(ys — y2) (2yı + yo + ya) + x2(y1 — ya) (2y2 + y3 + 41) 
(2) y = — E2 — WOY + yi + ya) — (w — 1) (a — wa) le = a) 
4[xi(ys — yo) + x2(y1 — y3) + xa(y2 — 91)] 
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and in other forms which look quite different. As we shall see in the next prob- 
lem, the circumcircle of the mid-triangle of the given triangle PPP; is the famous 
nine-point circle of the given triangle. The coordinates in (1) and (2) are there- 
fore the coordinates of the center of this nine-point circle. The answers to this 
and the two preceding problems are written in such a way that it is very easy to 
see that the center of the nine-point circle is the mid-point of the line segment 
joining the orthocenter and the circumcenter of the given triangle. 

24 Tosee that there are opportunities to use ideas of the preceding problems 
and the rest of this chapter in geometry, we look briefly at a triangle and its 
nine-point circle. Figure 1.391 shows a triangle PiP2P3, the points Qı, Qe, Qs in 
which the altitudinal lines intersect the lines containing the sides of the triangle, 
and the orthocenter Po. The points Mı, M2, Ms are the mid-points of the sides 
of the triangle, and the perpendiculars to the sides at these points intersect at a 
point C, the circumcenter of the given triangle. The points Ri, Re, Rs are the 
mid-points of the line segments P:Po, PoPo, PsPo. The famous nine-point-circle 
theorem says that the nine points Mı, Me, Ms, Qi, Qe, Qs, Ri, Re, Rg all lie on a 
circle. This circle, the nine-point circle, has its center at the mid-point S of the 
line segment joining the orthocenter Po and the circumcenter C. The radius 
of the nine-point circle is half the radius of the circumcircle. When the triangle 
is equilateral, the orthocenter, the circumcenter, the center of the nine-point 
circle, and the centroid (intersection of the medians) all coincide. When the 
triangle is not equilateral, the four points are distinct but are collinear, and the 
line upon which they lie is called the Euler line of the triangle. 

25 In this problem we use results of Problems 21 and 23 to obtain a new 
formula and a proof of the nine-point-circle theorem. We know from Problem 
23 that the mid-points Mı, Me, Ms of the sides of the triangle are on the circle; 
in fact, these three noncollinear points determine the nine-point circle. The 
remaining points R;, Re, Rs, Qi, Q2, Qs, which we must prove to be on the nine- 
point circle, are not necessarily distinct from each other and from Mi, M2, Ms, 
but our proof will not be a “partial proof’? which covers only “general cases.” 
Our proof will be a proof. Use a result of Problem 21 to show that the x coordi- 
nate of the point R, midway between the vertex P, and the orthocenter Pp is 


yilxs — xa) (xı + x1) + yelxı — x3) (xı + x2) + ys(x2 — xı)(xı + x3) 
+ (y2 — y) (ys — y2) (y1 — Y3) 
2[yi(~s — x2) + ye(xı — x3) + ys(x2 — xı)] 


Use this result to show that the x coordinate of the point midway between Rı 
and the mid-point M, of the line segment PPs is the x coordinate of the center 
of the nine-point circle given in Problem 23. Remark: This fact and the associ- 
ated fact involving y coordinates imply that the points Rı and M, are at opposite 
ends of a diameter (line segment, not number) of the nine-point circle. Similar 
proofs (which are attained by cyclic advances of subscripts) show that R, and M2: 
are at opposite ends of a diameter and that Rs and Ms are at opposite ends of a 
diameter. This proves that Ri, Re, Rs, lie on the circle. We recall that Q; is 
the point at which the altitudinal line through Pı intersects the line containing 
the vertices P and P3, that R, is on the altitudinal line, and that M, is on the 
line containing P2 and Ps. In case Q, coincides with M, or Ri, we conclude that 
Qı is on the circle. In the contrary case, the angle RiQ,M; is a right angle. 


x= 
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Since the line segment Rı Mı is a diameter of the circle, this implies that Q; must 
be on the circle. Cyclic advances of subscripts prove that Q: and Q; lie on the 
circle. This completes the proof of the nine-point-circle theorem. 

26 This problem involves the intersection of the medians of the triangle hav- 
ing vertices P1(%1,91), Pe(x2,ye), Ps(%s,ys). For each k = 1, 2, 3, let M, be the 
mid-point of the side opposite Pa. Show that the equation of the line containing 
the median P;M, can be put in the form 


(1) (yo + ys — 2y) (x — x1) — (x2 + x3 — 2xı)(y — yı) = 0. 


Show that if we define * and 9 by the formulas 


(2) patata, patty, 
then 
x2 + x3 — 2K, = %1 + x: + xs — 3x1) = 3(% — xı) 
yet ys — 2y1 = yı + y2 + ys — 3y1 = 3 (F — yı) 


and (1) can be put in the form 
(3) S — y) — x) — (% — x) — y) = 0. 


Show that (3) implies that the point (%,¥) lies on the median P,Mı. Finally, 
show how this work can be modified to prove that the point (#,) lies on the other 
two medians and hence is the point of intersection of the medians. Remark: 
One reason for interest in this matter can be understood when we know enough 
about centroids. The point (#,9), the intersection of the medians, is the centroid 
of the triangular region bounded by the triangle. It is also the centroid of the 
set consisting of the three vertices of the triangle. Moreover, it is the centroid 
of the triangle itself, that is, the set consisting of the sides of the triangle. The 
coordinates of the intersection of the medians were obtained in a tricky way. 


It is possible to put the equation (1) of the median PM, and the equation of the 
median PM in the forms 


(4) (y2 + ys — 2y1)~ — (x2 + x3 — 2x1)y 

= (y2 + ys — 2yı)xı — (x2 + xs — 2xı)yı 
(5) (ys + yı — 2y:)x — (x3 + xı — 2x2)y 

= (ys + yı — 2y:)x2 — (x3 + xı — 2x)y2 
and obtain the coordinates of the intersection of the medians by solving these 
equations for x and y without using trickery. There is, however, no guarantee 
that time invested in a study of (4) and (5) will produce attractive dividends. 
It is easy to obtain ponderous formulas for x and y, but it is not so easy to reduce 
the formulas to the right members of the formulas (2). 


1.4 Distances, circles, and parabolas As we shall see, the distance 
formula 


(1.41) d = V (xa — x1)? + (y2 — yi)? 


gives the distance d between two points P1(x1,y1) and P2(x2,y2) in the 
plane. To prove (1.41), we notice first that if yo = Yı. then the points P: 
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and P: lie on the same horizontal line and the formula reduces to the 
correct formula d = |xe — xıl. If x: = xı, then P, and Pe lie on the same 
vertical line and the formula reduces to the correct formula d = |y2 — yil. 
When P,(x1,1) and Pe(*xe,ye) are two given points for which x: Æ x; and 
ye Æ yi, we can, as in Figures 1.42 and 1.421, let Q(x2,y:) be the point on 


P2(x2, Y2) Po(X2, ¥2) 
d 
\Ya~1| Yo] g 
Q (X2 y1) Q(x J ) 
Piny) [%2] ! ue \X_—~— xy, Pily) 
Figure 1.42 Figure 1.421 


the horizontal line through Pı and on the vertical line through Pa. The 
length of the horizontal line segment P,Q is x: — xı if x2 — xı = O, is 
xı — x2 if xı — x2 Z 0, and is |x — x,| in each case. The length of the 
vertical line segment QP2 is ye — yı if ye — yı 20, is yı — yo if 
yi — ye 2 0, and is |ye — yıļ| in each case. With the understanding that 
the distance d between P, and P» is the length of the line segment P,P», 
we can therefore apply the Pythagoras theorem to the right triangle PQP: 


to obtain 

d? = |x — xl? + lye — yal? 
and hence 

d = (x: — x1)? + (y2 — ys)? 


Since d 2 0, taking square roots gives the 

required formula (1.41). Figure 1.43 
We are all familiar with the fact, illustrated 

in Figure 1.43, that the circle C with center at Po(h,k) and radius a is the 

set of points in the plane whose distances from Po are equal to the radius a. 

From the distance formula, we see that the point P(x,y) lies on this circle 

if and only if 


(1.44) (æ ~ h)? + O — kh)? = a”. 


This is therefore the equation of the circle with center at (h,k) and radius 
a. We must always remember this and the fact that 


(1.45) x? + y? = q? 


is the equation of the circle with center at the origin and radius a. 
The equation of the circle with center at (—2,3) and radius 5 is 


(1.451) (x + 2)? + (y — 3)2 = 25. 
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When the parentheses are removed and the constant terms are collected, 
this equation takes the less informative form 


(1.452) x? + y? + 4x — 6y — 12 = 0. 
This has the form 
(1.453) x +y? + Dx + Ey +F = 0, 


where D, E, and F are constants. It turns out that for some sets of 
values of D, E, and F, (1.453) is the equation of a circle. To try to write 
(1.453) in the standard form (1.44), we begin by writing it in the form 


(1.454) (2+ De+ )+(y?+fy+ )=—-F. 


The next step is to add a constant to the term x? + Dx so that the sum 
will be the square of a quantity of the form (x + Q). What shall we add? 
A good look at the formula 


(x + Q)? = x? + 20x + Q’ 


provides the answer: divide the coefficient of x by 2 and square the result. 


Thus we add D?/4 and E?/4 to both sides of (1.454) to obtain 


2 2 2 2 
(8+ De+F)+(e+ By +5) -2tE— 


or 


2 2 2 2 — 
ao (242) (+E -2t 


We can now see how the graph depends upon the constants D, E, and F. 
In case D? + E? ~ 4F > 0, then (1.46) is the equation of the circle with 


center at (—4D, —4E) and radius 4 y/ D? + E? —4F. In case 
D? + E? — 4F = 0, the equation becomes 


(1.461) (x + 4D)? + O + $B)? = 0. 


This equation is satisfied when and only when x = —$D and y = —4E 
so the graph is the single point (—4.D, —4£). Incase D? + E? — 4F <0, 
there are no pairs of values of x and y for which the equation is satisfied. 
One is tempted to say that the poor equation has no graph, but the graph 
is actually the empty set, that is, the set having no points in it. Thus, 
determination of the graph of the equation 


x? +y? + 6x —7y+8 =0 


is made by completing squares. The process is important and must be 
remembered. 


l Before starting the next paragraph, we look at some algebra and ways 
in which it is printed. The quotient a/bc is called a shilling quotient and 
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o . . . . a . 
is often printed instead of the built-up quotient be Learning to read 


printed mathematics involving shilling quotients is an art that must be 
cultivated, and this is a good opportunity. Since “multiplication takes 
precedence over division” the quotient a/be means a/(bc) and does not 


mean (a/b)c. Thus, for example, 1/2k means 1/(22) or x and does not 


mean (1/2)k or 4k. When the next paragraph is read, the quotients 
should be handwritten in built-up forms so the calculations can be made 
more easily. If troubles appear, the difficulty may be the canonical one 
that arises when a printer converts an author’s 1/22 into 4%. Every- 
thing should be checked. 

We can get experience with the distance formula by starting to learn 
about parabolas. A parabola is, as we shall show in Section 6.2, the set 
of points (in a plane) equidistant from a fixed point F which is called the 
focus and a fixed line L which is called the dzrectrix and which does not 
pass through the focus.{ In order to obtain the equation of a parabola in 
an attractive form, we let 1/22 denote the distance from F to L so that 
1/2k = p and k = 1/29, where p is the distance (length of the “per- 
pendicular”’) from F to L. Then we put the y axis through F perpen- 
dicular to L and put the x axis midway between F and L as in Figure 1.47. 


Figure 1.47 


The parabola is the set of points P(x,y) for which FP = DP. Using the 
distance formula and the fact that y + 1/42 > 0 when FP = DP gives 


| 2 


t The assumption that F is a “fixed” point and L is a “fixed” line means merely that F 
and Lare “given” or “selected” in some way. Thereis noimplication that other points and 
lines are “unfixed” in the sense that they are moving. At one time the parabola was 
defined as the path (or locus) of a point P which moves in such a way that it is always equi- 
distant from F and L. There are reasons why it is better to say that a parabola is a point 
set. Everybody knows that pencil points and numerous other things move, but even if we 
swallow the dubious idea that “mathematical points” can move we still find that the old- 
fashioned definition does not tell how a point P should move to trace the whole parabola and 
not merely a part of the parabola. 
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This holds if and only if 


IX 1\? 
+(-8)-( +8) 


Simplifying this gives the very simple and attractive equation 
(1.472) y = kx, 


This is the equation of the parabola shown in Figure 1.47. 

The point VY on a parabola which lies midway between the focus and 
directrix of the parabola is called the vertex of the parabola. For example, 
when k ¥ 0, the point (%o,yo) is the vertex of the parabola for which the 
point F(xo, yo + 1/42) is the focus and the line L having the equation 
y = yo — 1/4k is the directrix. As Problem 26 invites us to discover, the 
equation of this parabola is 


(1.473) y — Yo = k(x — xo). 
When k # 0, the equation 
(1.474) y = kx? + ax +b 


can be put in the form (1.473) by completing a square and transposing. 
Thus, when & + 0, the graph of (1.474) is a parabola, and we must always 


. _ til 
remember the fact. The distance from the focus to the vertex is F} 


When the positive y axis lies above the origin as it usually does, the focus 
is above the vertex when k > 0 and is below the vertex when k < 0. 

It is possible to proceed in various ways to calculate the distance 
d from a given point P(xoyo) to the line L having the given equation 
Ax + By +C =0. Problem 34 at the end of this section requires that 
the answer be worked out in a specified straightforward way. It is some- 
times convenient to omit the calculations and use the result, which is set 
forth in the following theorem. 

Theorem 1.48 The distance dfrom the point P(xo,y0) to the line L having 
the equation Ax + By + C = 0 is given by the formula 


d — | Axo + Byo + c| 
VA? + BP 


In some applications of this theorem, we use the version obtained by 
deleting the subscripts. 


(1.481) 


Problems 1.49 


I Draw the triangle having vertices at the points 4(2,2), B(—5,—2), and 
C(—2,—4). Calculate the lengths a, b, and c of the three sides BC, CA, and AB 
and show that c? = a? + 5% This implies that the triangle is a right triangle 


1.4 Distances, circles, and parabolas 29 


having a right angle at C, and hence that the lines P(x, 9) 
BC and CA must be perpendicular. Calculate ‘ 
the slopes of these lines and verify the perpen- 
dicularity. Make everything check. 

2 Figure 1.491 illustrates the familiar fact 
that, when Pi(«1,y1) and Po(%2,y2) are two distinct 
points in a plane, the set of points P(x,y) equi- 
distant from P, and Pz is the perpendicular bi- 
sector L of the line segment PiP2, Equate expres- 
sions for the distance PP; and PP, and simplify the result to obtain the equation 
of L in the form 


(x2 — xı) (« — ate) + (y2 — y1) (» — 2t) = 0. 


Then show that this line passes through the mid-point 


P(x, yı) 
(e) 


9 Pa(x2, Yo) 
Figure 1.491 


zits yT) 
P( 2 ° 2 


of the segment P,P, and is perpendicular to the segment. 

3 Sketch a figure showing the triangle having vertices at the three given 
points and then calculate distances to determine whether the triangle is isosceles 
(that is, has two sides of equal length): 


(a) (1,0), (8,2), (3,—-7) (b) (1,4), (6,—1), (7,6) 
(c) (0,4), (a,0), (b,b) (d) (a,a), (—a,—a), (b, —b) 


4 Find the length of the part of the x axis which lies inside the triangle 
having vertices at the points (—3,— 1), (5,1), and (1,5). Use a figure to deter- 
mine whether the answer is reasonable. 

5 Find the point on the x axis equidistant from the two points P;(—2, —1) 
and P2(4,3) in two different ways. First, find the equation of the perpendicu- 
lar bisector of the line segment P,P: and find the point where this bisector inter- 
sects the x axis. Then, with the aid of the distance formula, determine x so 
that the distance from (x,0) to P; is equal to the distance from (x,0) to Pe. 

6 Find the center and radius of the circle having the equation 


(«— we -5)+ y+ — 2) =0. 


Show that the center is the mid-point of the line segment joining the points 
A(1,2) and B(5, —4). 


7 Find the center and radius of the circle having the equation 
(x — x)(x — x2) + (y — 9) — y2) = 0. 
8 Show that the equation of the circle C with center at Po(xo y0) and radius 


a can be put in the form 
` 


(x — xo) (x — xo) + (y — Yo) (y — Yo) = a? 


30 Analytic geometry in two dimensions 


and that the equation of the tangent to C at a point P,(%1,1) on C can be put in 
the form 


(xı — xo) (x — xo) + (yı — yo) (y — yo) = a. 


Hint: Draw a figure and notice that when xı Æ xo we can calculate the slope of 
the line PoP, and use the fact (from plane geometry) that the tangent to C at 
P, is perpendicular to PoP. 

9 Show that the circle passing through the three points 4(0,2), B(2,0), and 
C(4,0) has its center at the point (3,3) and has radius 4/10. Hint: The per- 
pendicular bisectors of the segments 4B and BC are easily found, and their 
intersection is the required center. 

10 A circle passes through the points (0,7) and (0,9) and is tangent to the 
x axis at a point on the negative x axis. Find the radius, center, and equation 
of the circle. 

11 Let 0 < a < b and find the radius r and center (h,k) of the circle which 
passes through the points (0,2) and (0,b) and which is tangent to the x axis ata 
point to the left of the origin. Ans.: 


ate h=-Va, b= St’. 


T= 


12 A circle has a diameter (line segment, not number) on the x axis. The 
circle contains the two points (4,0) and (b,c) for which c #0. Show that b ¥a 
and find the center of the circle. Ans.: 


b? + ce? — a? 
( 2(b = a)” o) 

13 The points 4(—a,0) and B(a,0) are the ends of a diameter (line segment, 
not number) of a circle of radius a having its center at the origin. Write and 
simplify the equation which x and y must satisfy if 4, B, and P(x,y) are vertices 
of a right triangle the side 4B of which is the hypotenuse. 

14 An equilateral triangle has its center at the origin and has one vertex 
at the point (4,0). Find the coordinates of the other vertices and check the 
results by use of the distance formula. 

15 Sketch a figure which shows whether there are values of y for which the 
point (0,y) is equidistant from the points (—4,1) and (7,—2). Then attack the 
problem analytically. Make everything check. 

16 An equilateral triangle in the closed first quadrant has vertices at the 
origin and at (4,0). Find the coordinates of the third vertex and the slopes of 
the sides. 

17 An isosceles triangle is placed upon a coordinate system in such a way 
that its vertices are (—a,0) ,(a,0), and (0,4). Prove analytically that two of the 
medians have equal lengths. 

18 A triangle has vertices at A(—a,0), B(b,0), C(0,c). Prove that if the 
medians drawn from 4 and B have equal lengths, then the triangle is isosceles. 

19 Find the values of the constant b for which the line having the equation 
y = 2x + b intersects the circle having the equation x? + y? = 25. Ans. 


lb] S 4/125. 
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20 A triangle has vertices Pi(%1,y1), Pe(x2,2), 
P3(x3,y3). Prove analytically that 4 times the sum of 
the squares of the lengths of the medians is equal to 
3 times the sum of the squares of the lengths of the 
sides. 

21 Discover for yourself that a part of a parabola 
can be drawn with the aid of the right triangle (or 
rectangle), string, and pin mechanism shown in Figure 
1.492. A string of length ED has one end fastened 
to the triangle at E and has the other end fastened to 
a pin at the focus F. A pencil point at P keeps the Figure 1.492 
string taut, so FP = DP, and traces a part of the 
parabola as the base of the triangle is moved along the directrix. Such con- 
structions are taboo in the classical ruler-and-compass geometry of Euclid, but 
in analytic geometry we can recognize the existence of all kinds of machinery. 

22 Supposing that p > 0, find and simplify the equation of the parabola 
whose focus is at the origin and whose directrix is the line having the equation 


y = —p. Ans: y= 5 (x2 — p?). Remark: If we set k = 1/2p, then. the 


equation takes the form y = k(x? — 1/4k?). The parabolas obtained by taking 
different values of p or k constitute a family of confocal parabolas; concentric 
circles have the same center and confocal parabolas have the same focus. 

23 Supposing that p <0, find and simplify the equation of the parabola 
whose focus is at the origin and whose directrix is the line having the equation 
y= —Ż. 

24 Supposing that p > 0, find and simplify the equation of the parabola 
whose focus is at the origin and whose directrix is the line having the equation 


x = —p. Ans. r= 7 0 — p’). 


25 Find the equation "of the parabola whose focus is the point (12,0) and whose 
directrix is the line having the equation x = —12. Ans.: x = y?/48. 

26 Supposing that k = 0, use the distance formula to obtain the equation 
satisfied by the coordinates (x,y) of points P equidistant from the point F(xo, 
yo + 1/4k) and the line L having the equation y = yo — 1/4k. Outline of solu- 
tion: A point P(x,y) lies on the parabola if and only if FP = DP, where D is the 
point (x, yo — 1/4k). Writing FP and DP in terms of coordinates gives an 
equation which reduces to (1.473). 

27 Supposing that A > 0 and X > 1, find and simplify the equation satisfied 
by the coordinates of the points P(x,y) whose distances from the point 4(—A,0) 
are À times their distances from the point B(h,0). Ans.: 


(e-t) += (G4) 


28 Still supposing that 4 > 0 and A > 1, show that the graph of the answer 
to Problem 27 is a circle having its center at a point Po on the x axis. Find the 
x coordinates of the points P, and Pz where the circle intersects the x axis. Ans.: 
See Figure 1.191, which displays the x coordinates of the points and shows their 
correct positions relative to 4, 0, B and to each other. 
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29 From the first two of the equations 


(1) x+y + ax t+ by +e =0 
(2) x? +y? + ax + by +c. = 0 
(3) (az — ai)x + (b: — diy + (c2 — c1) = 0 


we can obtain the third by equating the left members of (1) and (2) and simpli- 
fying the result. Supposing that the graphs of (1) and (2) are nonconcentric 
circles, show that the graph of (3) is a line perpendicular to the line containing 
the centers of the circles. Show also that if these circles intersect in one or two 
points, then the line contains the point or points of intersection. Remark: The 
line is called the radical axis of the circles, it being named because some people 
want to talk about it. 

30 The points Pi(%1,y1), Po(x2,y2), and Ps(x3,ys) are, in positive or counter- 
clockwise order, the vertices of an equilateral triangle. Find formulas which 
express x3 and ys in terms of the coordinates of Pı and Pe. Solution: While the 
problem can be attacked in other ways, we eliminate difficulties involving order 
relations by observing that if the half-line extending from P, through P2 makes 
the angle @ with the positive x axis, then the half-line extending from P, through 
P; makes the angle 0 + 2/3 with the positive x axis. Let a be the lengths of 
the sides of the equilateral triangle. The definitions of the trigonometric func- 
tions then give 


(1) Xo — xı = acos l, ~y = asin 8 


(2) szanse 


4 OVI 
= z5 4 cos 8 -z 4 sin 6 


(3) ys — yı = z) = V3 acoso +l asint. 


| 

N 
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In obtaining the latter formulas, we use the “addition formulas” 


(4) cos (0 + $) = cos 6 cos ¢ — sin 0 sin d 
(5) sin (0 + $) = sin 0 cos ¢ + cos @ sin d 


and the values of the sine and cosine of 7/3 which can be determined with the 
aid of Figures 1.493. From (2), (3), and (1), we obtain the 


. answers 
\ 1 3 
4 yo © n=nti -a)y 
2 \ 
3 1 
BO  nsnt m-ati oy. 
\ 
>A JN . 
1 Remark: The points in the xy plane for which both coordi- 
Figure 1.493 nates are integers are called lattice points. Our results 


enable us to show very easily that triangles having vertices 
at lattice points cannot be equilateral. To prove this, let an 
equilateral triangle have two of its vertices, say P, and P, at lattice points. 
Then x1, x2, Yı, ye are integers and, since 4/3 is irrational, (6) shows that xs can- 
not be an integer unless yo = y}. If yo = yi, we must have x. Æ x1, and then 
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(7) shows that y cannot be an integer. This shows that if two vertices of an 
equilateral triangle are lattice points, then the third vertex cannot be a lattice 
point. 

31 Let n be a positive integer. Let mı, ma ** °, ma, Xn ¥ °° ny 
1, Ya * ° * » Yna be numbers for which 


(1) mòt mt >- +m = M>0. 
Let 


mıxı + mote s**t Maka 


Q) z= k g = mot maye t i H mae 


For each k = 1, 2, ++: , n, let rą be the distance from (x,y) to (xz,yz) and let 
dą be the distance from (#,9) to (xry). A timid person may be comforted by 
the special case in which n = 4, m, = mz = m; = m, = 1, and the points (x,,yz) 
are the vertices (1,1), (—1,1), (—1,—1), (1,—1) of asquare. Confining attention 
to the special case if this be deemed desirable, prove that 


(3) mri + mag + >- + mra = Mi — z) 4+ (y — 5) 
+ md? + md? +--+ + md’. 


With the aid of this result, let J be a constant and describe the set of points (x,y) 
for which 


(4) mır? + mri t+ + mar = I. 


32 Let Pı, Po, Ps have coordinates (x1,y1), (x22), (%3,y3), respectively. The 
triangle inequality 


(1) W(xs — x1)? + (ys — y1)? S V (x2 — x1)? + (y2 — 91)? 
+ V (x3 — x2)? + (ys — ye)? 


says that the distance from P; to P3 is less than or equal to the sum of the distance 
from P, to Pz and the distance from P» to P}. In more advanced mathematics, 
analytic proofs of (1) and more or less similar inequalities are very important. 
Show that setting 


(2) a, = X%2— Ši, ao = V2 — Yi; bi = x3 — X25 b: = ya — Y2 


puts (1) in the more agreeable form 


(3) V (ar + b)? + (a2 + b)? S Vad + a} + Vbi + b 
By squaring and simplifying, show that (3) holds if 
(4) larbi + azba| S Va? + a3 Vb? + b. 


By squaring and simplifying again, show that (4) holds if 
(5) 0 S (aib: — azb1)?. 
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Finally, tell why (5) must hold. Remark: In order to appreciate the significance 
of this work, we must do a little thinking about “elementary”? mathematics. 
It is sometimes said that a straight line is the shortest distance between two points. 
If this silly collection of words means anything it means that the length (a num- 
ber) of the line segment (a point set) joining two points P; and Pe is less than the 
length (a number) of each other path (a point set) joining Pı and Pz We must 
study more mathematics before we can learn what we mean by a path joining 
Pı and P: and what we mean by the length of such a path. In some parts of 
“advanced? mathematics, the multifarious axioms of Euclid and the theorem 
of Pythagoras are bypassed and the number d in the formula 


d = V (x2 — x1)? + (y2 — 41)? 


is defined to be the distance (in Euclid space of two dimensions) between the 
two points P;(%1,y1) and Po(x2,y2). It is useful as well as possible to define d by 
other formulas to obtain spaces that are not Euclid spaces. In such situations 
it is necessary to use analytical methods instead of geometrical methods to 
determine whether triangle inequalities hold. 

33 Four numbers 431, 412 421, and az determine the equations 


x’ = aux +a 
(1) , 11 12 


y = an% + doy 


into which we can substitute the coordinates of a given point (x,y) to obtain the 
coordinates of a transform, or transformed point, (x’,y’). Supposing that (%1,y1) 
and (x292) are two given points and that D is the distance between their trans- 
forms (x3, y1) and (x5, y2), find D2. Ans.: 


(2) D? = (a3, + a31) (x2 — xı)? + (aiz + a22) (ye — yı)? 
+ 2(a11412 + azaz) (x2 — xı)(y2 — y1). 


Remark: The transformer is called isometric if the distance d between two points 
is always the same as the distance D between their transforms. If the trans- 
former is isometric, we can put x: — xı = l and y2 — yı = 0 to obtain 


(3) ay, + ah = 1, 

we can put x: — x; = 0 and ye — yı = 1 to obtain 

(4) aia + az. = 1, 

and we can put xz: — xı = l and ye — yı = 1 and use (3) and (4) to obtain 

(5) 411412 + azaz = Q. 

On the other hand, if (3), (4), and (5) hold, then (2) shows that the transformer 


is isometric. 
34 Supposing that the first of the two equations 


(1) Ax + By = —C, Bx — Ay = Bx — Ayo 


is the equation of a given line L and that Po(xo,yo) is a given point, find the 


1.5 Equations, statements, and graphs 35 


equation of the line through Po perpendicular to L and show that it is equivalent 
to the second of the two equations. Solve these equations to find that the 
coordinates x1, yı of the foot P, of the perpendicular from Po to L are 


_ B?xo — AByy — AC Ayo — ABx, — BC 
(2) “=a ON pere 
Show that 
(3) x1 — xo (Axo + Byo + C), 


=a ep 


yı — Yo = a (4x + Byo + C). 


rF 


Finally, use the fact that the distance d from Py to L is the distance from Po to 
P, to obtain the formula 
(4) — |Axo + Byo + c| 

V44 + B 


1.5 Equations, statements, and graphs The equation y = x + 2 
can be regarded as a statement that is true for some pairs of values of x 
and y, for example, x = 3, y = 5, and is false for some other pairs of 
values of x and y, for example, x = 7, y = 7. A similar remark applies 
to each of the equations x? + y? = 4, Ox + Oy = 1, and Ox + Oy = 0, 
and to each of the inequalities 0 < x < 1, y < x, and x2 +y? <1. 
Each is a statement that is true for some (or none or all) pairs of values 
of x and y and is false for the remaining ones. The graph of such a state- 
ment is the set or collection of points P(x,y) whose coordinates are pairs 
of values of x and y for which the statement is true. For example, the 
graph of the statement (or equation) y = x is a line L. We can always 
know that there is a substantial difference between an equation (or state- 
ment) and its graph (a point set). Hence, we may be carrying abbrevi- 
ation of language a bit too far when we sometimes follow the old and mis- 
leading custom of referring to “the line y = x” instead of to “the line 
having the equation y = x.” In any case, we should think about this 
matter enough to know that we are introducing analytic geometry and 
hopefully trying to make sense out of nonsense if we receive a mysterious 
order to “find the part of y = x in x? + y? = 1” and proceed to find the 
length of the part of the line having the equation y = x which lies inside 
the circle having the equation x? + y? = 1.f 

Most of the graphs that appear in our work are graphs of equations. 
However, graphs of inequalities can be important, and we look at some 
simple examples. The graph of the inequality xy > 0 consists of those 
points P(x,y) in the first quadrant (where x and y are both positive) 


t Persons who start picking up clear ideas about these things may even enjoy studying 
statements and sets in mathematical logic and elsewhere. 
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together with those in the third quadrant (where x and y are both nega- 
tive); see Figure 1.51. The graph of the inequality y < x consists of 
those points P(x,y) which lie on and below the line y = x of Figure 1.52. 


x<0, y<0 | x>0, y<0 


Figure 1.51 


J 


Figure 1.53 Figure 1.54 


The graph of the inequality x? + y? < 1 consists of the points inside the 
circle with center at the origin and unit radius. This set of points is 
often called the unit disk; see Figure 1.53. The graph of the inequality 


Figure 1.55 Figure 1.56 


1 < x? + y? < 4 is the set of points in the annulus or ring between two 
circles; see Figure 1.54. 

The equation y = x? is, as we saw in Section 1.4, the equation of a 
parabola. After plotting the points whose coordinates appear in the table 
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x O | +z | +3] +2 | +1 | 44) 42| 4#] 3 
y | O | el] 23l wt] 1|] 2] 4| 2) 9 


we are easily led to the correct conclusion that the graph of y = x? is the 
curve shown in Figure 1.55. It should be noted that the graph contains 
no point (x,y) for which y < 0; if y < 0, there is no x for which y = x’. 
The y axis is an axis of symmetry of the graph, because if (x,y) is a point 
on the graph, then the point (— x,y) is also on the graph. 

The graph of the equivalent equations 


(1.561) xy = 1, y= . 
is more complex. As we shall see later, the graph is a rectangular 
hyperbola. It is easy to add more items to the table 


10 


y 


and to sketch the part of the graph to the right of the y axis in Figure 1.56. 
A similar table in which x and y are both negative enables us to sketch 
the part lying to the left of the y axis. The graph contains no point (x,y) 
for which x = O or y = 0. The x and y axes are not axes of symmetry, 
but the origin is a center of symmetry, because if (x,y) is a point on the 
graph, then the point (—x,—y) is also on the graph. 

The symbol [x] represents, when we are properly warned, the greatest 
integer in x, that is, the greatest integer n for which n < x. Thus 
[1.99] = 1, [3.14] = 3, [0.25] = 0, [—0.25] = —1, [-3.01] = —4, and 
[2] = 2. Itis not difficult to show that the graphs of y = [x] and of the 
saw-tooth function y = x — [x] — $ have the forms shown in Figures 
1.57 and 1.571. 

Trigonometric functions will appear very often in our work, and there 
will be very many times when we must know the natures of the graphs of 


Figure 1.57 Figure 1.571 
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Figure 1.58 


y = sin x and y = cos x. The graphs are shown in Figure 1.58. We 
must always know that, except at the points of tangency, the graphs lie 
between the lines having the equations y = —l andy = 1. Moreover, 
r is a little bit greater than 3, and this must be fully recognized when the 
graphs are sketched. When we want to sketch the graphs, the first step 
is to draw guide lines one unit above and one unit below the x axis. The 
next step is to hop three units and a bit more to the right of the origin to 
mark r, and make another such hop to mark 27. We must be able to do 
this and sketch reasonably accurate graphs of y = sin x and y = cos x 
in a few seconds, and we must be able to look at the graphs and see 
answers to trigonometric questions just as we look at dogs and see answers 
to questions about canine structure. We cannot tolerate doubts about 
the assertions sin 0 = 0, cos 0 = 1, sin 7/2 = 1, cos 7/2 = 0, and dogs 
have two ears. ‘The table on the back cover of this book can be used to 
produce very accurate graphs, but this is seldom necessary. 

Finally, we are never too young to be informed that substantial parts 
of scientific lives are devoted to learning about and using equations akin 
toy = eand y = logx. Graphs of these equations are shown in Figures 
1.581 and 1.582. The exponentials and logarithms have base ¢, and e is a 
number that we shall encounter very often. Here again the tables on the 
back cover of this book can be used. While we should have basic informa- 
tion about graphs before we start our study of functions, limits, and the 


Figure 1.581 Figure 1.582 
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calculus, most of our work with equations and graphs will be done with 
the aid of the calculus. 


Problems 1.59 
Sketch graphs of the following equations and inequalities: 
1 y=x 2 x=0 3 y=0 
4 y= (x-1) 5 y = (x + 1)? 6 y= x3 
7 y=l +x 8 y=l +x 9 xy = —l 

1 1 1 
10 y=TF; H 9= aye 2 y=Tfæ 

1 1 

13 y= ž 14 yauts 15 yur 
16 y = |x| 17 y= |æ — 2] 18 y = 3(x + Il) 
19 0<x<1 20 0O<y<1 21 O0<x+y<1 
22 |x| <1 23 |x —2) <4 24 $<%< 3 
25 y < x? 26 |y| < |x| 27 |y| < x? 
28 |x| +|y| =1 29 |x| + Iyl <1 30 |x| +i >1 


31 With Figure 1.58 out of sight, sketch graphs of y = sin x and y = cos x. 
If unsuccessful, glance at Figure 1.58 and try again. 

32 Figure 1.591, which features half of an equilateral triangle each side of 
which has length 2, shows that 


LT 1l T Lrt V3 m id A \ 
sing = 5) cos 677%” sin 3 =F” cos 3 =F ; N 
\ 
Cultivate the ability to sketch this figure quickly. Use the v3 \ 
information obtained from it to locate points on the graphs / \ 
of y =sinx and y = cosx. Sketch a right triangle in i... 
which each leg has unit length and obtain more points on Figure 1.591 


the graphs. Finally, sketch graphs of y = sinx and 
y = cos y again. Remark: We need familiarity with our graphs, and we need 
confidence in them. 

33 Sketch graphs of 


(a) y = 3 sin x (b) y = sin 2x (c) y = sin (+ + 5) 


Remark: Graphs of equations of the form y = E sin (wx + a) are called sinusoids, 
and we hear very often that E is the amplitude, w is the angular frequency, and 
ais the phase angle of the sinusoid. 

34 Where are the points (x,y) for which 0 S x S 2r and sin x S y S cos x? 

35 Supposing that k ¥ 0, find the slope m of the secant line (or chord) 
containing the two points of the graph of the equation y = x? having x coordi- 
nates xı and xı +h. Ans.: 2x1 + h. 

36 It is sometimes quite important to have correct information about the 
graphs of y = xè? andy = x%, Sketch the graphs over the interval —2 S x S 2. 


40 Analytic geometry in two dimensions 


37 With the aid of the quadratic formula, show that the point (x,y) lies on 
the graph of the equation x? + xy + y? = 3 if and only if —2 Sx S 2 and y 
is one of the two numbers 


—x — V3(4 — x?) —x + /3(4 — x?) 
rr ny 
which are equal only when x = —2 and when x = 2, 
Formulate and prove an analogous statement in which 
the roles of x and y are interchanged. Find the coordi- 
nates of the eight points in which the graph intersects the 
; lines having the equations x = —2, x = 2, y = —2, 
Figure 1.592 y = 2, y =x, and y = —x. Remark: The graph is an 
oval which is shown in Figure 1.592 and which is, as Chapter 6 will show us, an 

ellipse. 

38 Sketch a graph of y = sin x over the interval 0 S x S 2r and then, with 
the aid of simple arithmetic facts like 0? = 0, (0.4)? = 0.16, (0.8)? = 0.64, use 
the result to obtain a graph of y = sin? x. 


39 Sketch graphs of y = cos 2x and y = (1 — cos 2%)/2. Remark: Because 
of the trigonometric identity 


«5 1 — cos 2x 
sin? x = — > 
the answers to this and the preceding problem are the same. 


40 Perhaps the classic guns-and-butter interpretation of the formula 
x+ty=M 


should not be overlooked. It is supposed that a chief has control of M man- 
hours of humanenergy. The chief may preempt x man-hours to provide pressure 
and power to keep his subjects in line and to preserve or extend his authority. 
Then, even when x = M and y = 0, there remain y man-hours part of which 
may be used for production of food, shelter, education, and sundries. Sketch 
a graph which shows how x and y are related. Hint: Do not ignore the basic 
idea that x 2 0 and y 2 0. 

41 Sketch graphs of the three equations Vy = Vx, y =x, y? = x? and 
make some relevant comments. 

42 Leta>0O. Show that the equation 


(1) Vi+ Vy = Va 


holds if and only if0 S x S a and 


(2) y = (V'a ~ Vx)? = a + x — 2 Vax. 

Without making onerous calculations, sketch rough graphs of (1), (2), 
(3) =a +x + 2vVax, 

and 

(4) (y — a — x)? = 4ax. 


Remark: Chapter 6 will reveal the fact that the graph of (4) is a parabola. The 
graphs of (1), (2), and (3) are parts (subsets) of the parabola. 
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43 Let us suppose that a man who marries should select for his wife a woman 
whose age is 10 years more than half his age. Construct a graph for use of 
bachelors who are accustomed to picking information from graphs in the Wall 


Street Journal and everywhere else but are unaccustomed to making abstruse 
mathematical calculations. 


44 Let time ¢ be measured in seconds so that, as we can see by replacing x 
by żin Figure 1.58, sin ¢ increases from 0 to 1 and decreases back to Oin r (about 
3) seconds. If you can acquire the ability to move your pencil point in the xy 
plane in such a way that its coordinates (x,y) at time ¢ are x = sin ¢ and y = 
Isin ż|, you will get a V for victory. 


1.6 Introduction to velocity and acceleration Teachers of mathe- 
matics and physics are accustomed to difficulties involved in correlating 
studies of graphs, vectors, velocities, and accelerations in mathematics to 
studies of diagrams, forces, velocities, and accelerations in physics. 
There is a reason why it is not easy to achieve complete correlation. In 
order to be able to solve just one of his easiest problems involving motion 
of a body or particle, a physics student requires a little information about 
several basic concepts. This section is introduced at the end of our first 
chapter because it may be a desirable or even necessary part of some 
educational programs. Students can be advised to read it to obtain 
preliminary ideas about their external world but, so far as this course is 
concerned, can be advised to postpone the learning of the mathematics in 
it. Some and perhaps most teachers will proceed directly to the next 
chapter and will devote a classroom hour to this section only if and when 
their students face the prospect of studying falling bodies in their physics 
courses before they encounter derivatives and integrals in their mathe- 
matics courses. ‘The next chapter, Chapter 2, treats vectors in space of 
three as well as fewer dimensions. While physicists can regret that this 
delays our full treatment of velocities and accelerations, they can also 
rejoice in the fact that the delay permits production of a much more use- 
ful treatment of the matter.t 

As the preface states, the first third of this book contains all or nearly 
all of the analytic geometry and calculus that students normally encounter 
in their introductory full-year college and university courses in physics. 
In a few weeks, formulas like 


(1.611) s= T + oot + so 

(1.612) o=% = g+ 
dy ds 

(1.613) a222i 


t This is a very conservative statement. Vectors, like numbers, are important things 
and there are many reasons why they should be encountered early and frequently when 
geometry and calculus are studied. 
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will be completely familiar and meaningful to us. Meanwhile, we makea 
preliminary study of ways in which they are related to experiments 
involving falling bodies. Chapters 3 and 4 will give much less informa- 
tion about the physics experiments but much more information about 
the mathematics. 

Suppose we have, as in Figure 1.62, a vertical s axis with 
the positive s axis below the origin. For laboratory experi- 
o ments, we can place one meter stick above another and 

i place minus signs in front of the numbers on the upper 

2 stick. We can suppose that a body is, at time ¢ = 0, 

3 falling or just being dropped so that it travels past the 

s markings on our meter sticks with increasing rapidity as 

Figure 1.62 time passes. On the other hand, we can suppose that the 

body is rising at time ż = 0 so that it rises for a while 

before it begins its descent. We may suppose that distances are meas- 

ured in centimeters, so that s = 20 when the body is 20 centimeters 

below the origin, and that ż is measured in seconds, so that £ = 0.5 when a 
timing device shows a half-second after our time origin or zero-hour. 

Anyone who tosses a body upward and observes the ensuing motion 
must realize that it is not an easy matter to use an ordinary clock to 
obtain accurate data giving the coordinate s of the body at various times 
t. While solid information about such matters must be obtained from 
physicists, we can all recognize the possibility of getting useful data with 
the aid of apparatus so arranged that at each of the times ¢ = 0,7 = 0.01, 
t = 0.02, : = 0.03, - - - an electric spark jumps from a pointer on the 
falling body to burn a tiny hole in a long strip of 
paper attached to the meter sticks. Whenenough 
reasonably accurate information has been obtained 
in one way or another, we can use it to plot points 
(t,s) in a ts plane and obtain a graph more or less 
like that shown in Figure 1.63. For each ¢ within 
the domain for which measurements are made, 
the s coordinate of the point P(t,s) on our graph is 
Figure 1.63 a more or less good approximation to the coordi- 

nate or displacement of the body at time t. 

Some information should be in hand when we undertake to use our data 
and graph to obtain information about our falling body. Without pre- 
tending to have precise ideas yet, we can start with the rough idea that 
forces and velocities and accelerations exist and that these things are 
vectors or are represented by vectors. The reason our falling body 
plummets toward the center of the earth, with speed increasing when it is 
headed downward, is that the earth exerts a gravitational force upon it. 
The magnitude of this force is the weight of the body. Since we find no 
perceptible change in the weight of a body when we raise or lower it a few 
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meters, we conclude that, so far as our problem is concerned, the mag- 
nitude of the gravitational force may be considered to be a constant, that 
is, the same at all places on our meter sticks. We can know that air 
resistance retards the motion of moving bodies but, when heavy bodies 
fall only a few meters, this produces consequences so small that our 
measurements are unaffected. Thus, so far as our measurements can 
tell, we are investigating the motion of a body which moves on a line 
through the center of the earth with only a constant gravitational force 
acting upon it. 

In what follows, vectors are denoted by boldface letters as they usually 
are in printed scientific works.f Study of physics books or the next 
chapter reveals the meaning of the statement that the gravitational force 
F which the earth exerts upon our falling body is meu, where m isa 
positive number (the mass of the body), g is a positive number (the scalar 
acceleration of gravity), and u is a unit vector which lies on the line along 
which our body falls and is directed toward the center of the earth. The 
velocity V and the acceleration a of our falling body are vectors, but they 
are representable in the form v = vu and a = au, where v and a are real 
numbers that are not vectors and are sometimes called scalars to empha- 
size the fact that they are not vectors. Thus v is not a velocity, but it is 
the scalar component of a velocity. We callv a scalar velocity. Similarly, 
ais a scalar acceleration. 

Fortified by at least a hazy understanding of the significance of our 
problem, we use experimental data of a table or of Figure 1.63 to learn 
about the scalar velocity v and the scalar acceleration a of our body. Let 
tı and tz be two different times and let sı and sz be the displacements of our 
body at these times. As the formula 


(1.64) . a = average scalar velocity 
2— ty 


indicates, the quotient on the left is called the average scalar velocity of our 
body over the time interval from the lesser to the greater of tı and tə. 
In case ty < te and sı < sz, the quotient in (1.64) has a very familiar form. 
Except that the units may be different, the quotient is a positive number 
of miles divided by a positive number of hours and hence is a number of 
miles per hour that we normally call an average speed instead of an 


t We pause to observe that boldface letters cannot be conveniently made with pencils, 
pens, crayons, and typewriters, and that a vector F (boldface) is often denoted by F. 
Readers are advised to look at F (boldface) and imagine that there is an arrow on top of it 
so they will, in effect, see the F which they write when they want to emphasize | the fact 
that the symbol is (or represents) a vector. Thus the formula F = ma becomes F = ma 
when it is transferred from printed material to handwritten hieroglyphics. Sometimes the 
arrows are printed to remove the necessity for use of imaginations, but we can, in effect, 
be paid for using our imaginations because printing the arrows increases costs of books. 
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average scalar velocity. After appropriate preliminary topics have been 
studied, Chapter 3 will tell precisely how the velocity v and the scalar 
velocity v at time ¢ are defined. It turns out that the scalar velocity v 


ds oe . 
and a number Te called the derivative of s with respect to ż, are equal to 


each other and, moreover, that the quotient in (1.64) is nearly equal to 


ds , . 
vand J whenever tı = ¢ and ts is nearly equal to t but t: Æ t. ‘To obtain 


an estimate of the scalar velocity v at a particular time t from experimental 
data, which may be presented in a graph, it therefore suffices to calculate 
and use the average scalar velocity over a short time interval beginning 
or ending at t. Use of experimental data for this purpose is rendered 
difficult by the fact that, when żı and #2 are nearly equal, small relative 
errors in measurements can produce huge errors in estimates of the value 
of the quotient (s2 — 51)/(t2 — tı). It is a truly 
remarkable fact that when reasonably accurate 
data are collected and intelligently used, it is pos- 
sible to estimate v for various values of ¢ and to 
find that the points (¢,2) in a tv plane come so close 
to lying on a line that all of the deviations can be 
attributed to errors in measurement and calcula- 
tion. Thus our experimental work leads to the con- 
clusion that, as in Figure 1.65, the graph of v versus? 
Figure 1.65 is either a part of a line or a very close approxima- 
tion to a part of a line. 

The scalar acceleration a of our falling body is defined in terms of the 
scalar velocity v in the same way that the scalar velocity v is defined in 
terms of the scalar displacement s. Thus, in addition to the basic 
formula (1.64), we have the basic formula 


(t1,0;) (2,02) 


3) — e 
(1.66) Soy = average scalar acceleration 
2— t 


in which 2; and vs are the scalar velocities at times ż, and z» The scalar 


. . „do 
acceleration a at time ż and the derivative — are equal to each other and, 


dt 
. . . dv 
moreover, the quotient in (1.66) is nearly equal to a and to 7 whenever 
tı = ¢ and t, is nearly equal to ż but tg = t. On the basis of the assump- 
tion that the graph of v versus ¢ is a part of a line as in Figure 1.65, the 
average scalar acceleration is the slope mı of the part of the line. The 
hypothesis that each average scalar acceleration is the constant m; leads 
to the conclusion that, at each time t, the scalar acceleration is mı; that 
is, @ = m. Calculations from reasonably accurate data show that mı 


is about 980 when centimeters and seconds are used and about 32 when 


1.6 Introduction to velocity and acceleration 45 


feet and seconds are used. This number m; is the gravitational constant 
g to which we have referred. 

The simplest reasonable conclusion that can be drawn from data 
involving falling bodies is the following. To each place on the surface of 
the earth there corresponds a positive constant g, the scalar acceleration 
of gravity at that place, such that when a body moves on a vertical line 
near the surface of the earth with no appreciable external force other than 
the gravitational force exerted upon it, reasonable answers to problems 
can be based upon the assumption that the body is accelerated toward the 
center of the earth and that the scalar acceleration is g. Another similar 
but more lengthy conclusion involves the idea that the graph of v versus ż 
is a line and that reasonable results are obtainable from the formula 
y = gt + vo, where vo is a particular constant that depends upon choice of 
the time-origin used when studying a particular flight. Finally, it is 
possible to use the data and quite primitive mathematics to reach the 
more abstruse conclusion that there exist constants g, vo, and so, thelatter 
two of which depend upon the time-origin and the space-origin used in the 
study of a particular flight, such that reasonable results are obtainable from 
(1.611). A campaign to reach this conclusion can start with the observa- 
tion that the graph in Figure 1.63 does look like a part of a parabola. 

Mathematicians do not, except when they are behaving like physicists, 
actually perform physical experiments. Mathematicians cannot, unless 
they have physical laws or other information upon which proofs can be 
based, prove the formulas that are useful in mechanical dynamics and 
thermodynamics and hydrodynamics and aerodynamics and electro- 
dynamics and economics and psychology and genetics and chemistry and 
cosmology. But mathematicians can, when they are given a few weeks, 
learn enough about derivatives and other things to enable them to start 
with given information and produce more information with astonishing 
ease. One who knows the content of Chapter 3 can start with the first 
of the three formulas 


(1.671) s = T + vo + So 
(1.672) y= i = gt + vo 

du @s 
(1 673) a= m7 E 


and produce the other two as fast as he can write. All he needs to do is 
apply standard rules for writing derivatives. The problems at the end of 
this section provide preliminary ideas about this matter. It is much 
more significant that one who knows the content of Chapter 4 can start 
with the last of the formulas and produce the other two as fast as he can 
write. All he needs to do is apply standard rules for writing integrals. 
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The physical significance of the constants in (1.671), (1.672), and 
(1.673) is worthy of notice. As we see by putting ¢ = 0 in (1.671), sois 
the value of s (the displacement) when ¢ = 0, so so is called the initial 
displacement. As we see by putting ¢ = O in (1.672), vo is the scalar 
velocity when t = 0, so vo is called the initial scalar velocity. It is easily 
seen from (1.672) that v = 0 when t = —20/g. The values (if any) of t 
for which s = 0 can be obtained by putting s = 0 in (1.671) and solving 
the resulting quadratic equation for ¢. In many applications, the space 
and time coordinates are so chosen that the initial displacement so and 
initial velocity vo are both 0. In this case (1.671) reduces to the simpler 
formula 


(1.674) s = $7’. 


The related formulas 
(1.675) t= E v = gt = Vgs, 


which give the time required for the body to fall a distance s and the speed 
attained when the body has fallen a distance s, are often useful. 

We conclude with a remark about uniform circular motion. Supposea 
particle starts at time z = Qon the positive x axis and moves, with angular 
speed w (omega) radians per second, in the positive (counterclockwise) 
direction around the circle of radius R having its center at the origin. 
Letting r denote the vector running from the origin to the particle P 
at time ż gives the first of the formulas 


(1.681) r= R( cos wti + sin wij) 
(1.682) y= wR(—sin wti + cos wij) 
(1.683) a= —w’R( cos wti + sin wf) 


where, as in Figure 1.684, i and j are unit vectors having the directions of 


Figure 1.684 


the positive x and y axes. Application of rules of Chapter 3 then gives 
(1.682) and (1.683) as rapidly as we can write them. Looking at (1.681) 
and (1.683) shows thata = —w*’r and hence that P is always accelerated 
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toward the center. The length of the vector r is always R, and the length 
of the vector a is always w?R. This shows that the magnitude of the 
acceleration is always w?R. These results are important in physics and 
engineering. Physics books that do not make effective use of good 
mathematics do not derive their results so efficiently. 


Problems 1.69 
1 Supposing that g, vo, and so are constants and that 
(1) s = $e? + vot + so 


at each time #, use notation like that in (1.64), so that s = sı when ¢ = żı and 
5 = Sq when £ = fg, to obtain the formula 


(2) S2 — 31, = zele — £) + volt2 — t1) 


and hence 


(3) 


Ay) — Si 1 
— = glt: + t1) + v 
te — fi 


when to Æ t;. Remark: Even though we have not yet encountered procedures by 
which such statements are made precise, we can temporarily accept without 
question the statement that the right side of (3) must be near gt + v% whenever 
tı and żz are both near ¢ and hence that 


(4) v = gt + vo. 
2 Supposing that g and vo are constants such that 
(1) v= gt + vo 


at each time #, use notation like that in (1.66), so that v = vı when ż = żı and 
9 = vo when t = ż to obtain the formula 


(2) V2 — 0, = g(te — tı) 
and hence 

Vo — i L 
(3) no-n 8 


when tz æ t;. Remark: A remark similar to that of the preceding problem is 
applicable here; the scalar acceleration a at time t is g. 

3 We should now be well aware of the fact that Problems 9.29 will appear at 
the end of Chapter 9, Section 2. While the trick is not used in this book, we can 
use the numbers 9.2908 and 9.2922 to identify problems 8 and 22 of Problems 9.29. 
Now comes the problem. Write a single number to identify formula 15 of Prob- 
lem 4 at the end of Section 6 of Chapter 12. Ans.: 12.690415. Persons who feel 
that this trick is complicated should think about the matter to capture some of the 
spirit of members of a research staff of a data processing department of IBM 
(International Business Machine Corporation) who find that such tricks keep 
them in business. 
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in three dimensions 


2.1 Vectors in E, To facilitate discussions and solutions of problems 
in geometry and calculus, and for many other purposes in pure and 
applied mathematics, it is necessary to know about things called vectors. 
All points and vectors with which we are concerned are supposed to lie in 
ordinary Euclid space E; of three dimensions in which such things as 
points, lines, planes, cubes, spheres, and automobiles can exist. The 
definitions of this section do not depend upon a coordinate system and are 


therefore said to be intrinsic definitions. We shall hear more about this 


matter later. ¢ 


Before introducing vectors, we observe the familiar fact that two 
distinct (that is, different) points P, and P2 determine the line PiP2 which 
passes through P, and P, and extends beyond P, and P» in two directions 
as in Figure 2.11. Vectors are more like line segments than like lines. 
An ordered pair Pı, P2 of distinct points, in which P, and P2 are respectively 
the first point and the second point in the pair, determines the vector P,P, 


ce G6 Jo . ° 
or arrow” or directed line segment” which runs (or extends) from the 
48 
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first point to the second point as in Figure 2.111. The purpose of the 
arrowhead is to show that the vector runs from P, to Pe. The vector 
shown in Figure 2.112 is not P,P», but is PoP}. The length (or magnitude) 
\P,P.| of a vector P,P; is the length of the line segment upon which it lies, 
that is, the distance between the points P, and Pz. If P: and P, coincide, 
that is, Pa = P,, the points do not determine a line but they do determine 
the vector PPs, which has length O and which is called the zero vector. 


D 
<9 SP 2 
v 


C P 
P, ! P; 


Figure 2.11 Figure 2.111 Figure 2.112 Figure 2.113 


As indicated in Figure 2.113, vectors are often denoted by boldface 
letters which keep us informed that the symbols represent vectors rather 
than numbers or chemical elements. Thus we can set u = PP; and 
v = P,P.. Two nonzero vectors u and V are said to be equal, and we 
write U = V when, as in Figure 2.113, they (i) lie on parallel lines, (ii) have 
equal lengths, and (iii) have the same (not opposite) directions. Two 
zero vectors Uo and Vo do not have directions, but we say that Uo = Vo 
anyway. If u is a nonzero vector and V is a zero vector, then U ¥ V. 
We use the ordinary 0 (zero) to denote the zero vector; it turns out that 
we will not need an arrow or distinctive type face to tell us whether 0 is the 
number zero or a vector having length zero. The advice given in a the 
footnote on page 41 merits repetition here. Whenever we see F (boldface) 
or any other letter that is boldface, we recognize that it is a vector and 
imagine that there is an arrow above it so that we, in effect, see the sym- 
bols F, u, etcetera, that are made by pencils, pens, and chalk. Thus our 
imaginations convert what we see into what we write, and the disadvan- 
tage of boldface print has disappeared. 

It is both interesting and important to know what is meant by the 
product kv of a number (real number or scalar) k and a vector v and by the 
sum U + V of two vectors. The definitions will imply validity of the 
formula 2u = u + u as well as other useful formulas. In case k = 0 or 
v = Oor both, the product kv is the zero vector, that is, kv = 0. Incase 
k # Oandv = 0, the vector kv is a vector such that (i) v and &v lie on the 
same or parallel lines, (ii) the length of &v is |&| |v|, and 
(iii) V and kv have the same direction if k > 0 and oppo- Figure 2.12 


site directions if k < 0. Figure 2.12 shows examples. 3y 
This definition implies that if v is a nonzero vector, vf f-» 
then the unit vector (vector one unit long) in the direc- -3v 


tion of v is (1/|v|)v or v/Įvl. 
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The sum r of two vectors uU and V is, esr F igure 2-43, the vector which 
runs from the tail of u to the head of v when the tail of v is placed at the 


head of u. The figure shows that 
v+u =u +v. Because the sum 
of two vectors is, as in Figure 2.13, 
the diagonal of a parallelogram, the 


Us rule (or law) for addition of vectors 

a is called the parallelogram law. Fig- 

Wy i ure 2.131 shows the sum r of four 

Figure 2.13 Figure 2.131 vectors Uj, Us Us, Us. In applied 


mathematics the sum of two or more 
vectors is sometimes called their resultant. 


The difference u — v is defined to be the sum of u and —V, so that 


u-v u — v = u + (—v). The most obvious way to find 
u — v is to find —V and add it to u. In substantially 
v 4 all cases, it is quicker, easier, and more useful to observe 


that u — v is the vector which we must add to V to 

obtain the sum u. When the tails of u and V coincide, 

the vector U — V runs from the head of v to the head of u. 
Figure 2.44 It is worthwhile to look at the italicized statement and 

Figure 2.14 until both are thoroughly understood and 
remembered. The figure clearly says that 


(2.141) u=v-+(u-— vy). 


Since angles between vectors can be sources of confusion and mis- 
understanding, we give a little careful attention to the subject. In case 
one or the other of two vectors has length 0, there is no reasonable way to 
determine an angle that should be called the angle between them, and we 
say that the angle is undetermined or undefined. ‘Two sharpened pencils 
of positive length represent vectors in the directions 
of theirsharpened tips. In case these vectors do not 

intersect, we can choose any point O in E; and replace 

the vectors by equal vectors having their tails at O 
asin Figure 2.15. Suppose first that these vectors u 
and v have neither the same nor opposite directions. 
Figure 2.15 These vectors then determine the plane in which they 
lie. The angle 6 is determined by the method used in 

trigonometry to introduce radian measure. The first step is to draw, in 
the plane of the vectors, a circle of radius a with center atO and to find the 


length s of the shorter of the two arcs into which the vectors cut the circle. 
The number 6 defined by 


u 


length of are 


(2.151) radius 


or angle = 


D 
ll 
alu 
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is called the angle between U and V or the angle which u makes with v or the 
angle which v makes withu. Thus angles are numbers.t Ifu and v have 
the same (or opposite) directions, slight modifications of the above con- 
struction give s = O (or s = wa) and the same formula (2.151) is used to 
define 0. In each case we have 0 Ss S ra, and hence O0 < 60S =. 
When working with angles between vectors, we never have to bother with 
“negative angles” and “angles greater than straight angles.” For 
perpendicular vectors, we have 0 = w/2. We, like electronic computers 
and some trigonometric tables, use radian measure and seldom bother 
with degrees, minutes, and seconds. 

The remainder of the text (not problems) of this section gives basic 
information about products of vectors. The importance of the material 
will be revealed later in this book and by textbooks in other subjects in 
pure and applied mathematics. It is not necessary to presume that the 
material is difficult. In fact, students who do not have the good fortune 
to study this material calmly in mathematics sometimes find that their 
teachers in physics and engineering undertake to teach all of it in a few 
seconds. 

There are two different kinds of elementary products of vectors u and v 
that turn out to be interesting and useful. These are the scalar product 
(or dot product) defined by the formula 


(2.16) -u-v = ful |v| cos 6 
and the vector product (or cross product) defined by the formula 
(2.17) u xv = |u| |v| sin ôn. 


These formulas will now be discussed. If u = 0 or v = 0 or both, the 
angle @ appearing in the formulas is not determined by u and v, but the 
products u-v and u x V are defined 
to be 0 anyway. Henceforth, we 
consider cases in which |u| > 0 and 
|v| > 0, these being the lengths of u 
and v. Then, as in Figures 2.18 and 
2.181, the two vectors determine an 
angle @forwhichO S@S 7. Incase Figure 2.18 Figure 2.181 
0 < 0 S x/2, the number |v| cos 6 

is the length of the projection of the vector Vv on the vector U and the 
scalar product is therefore the product of the length of u and the length 


t Dictionaries convey assorted ideas akin to the ideas that an angle is the “enclosed space” 
or “corner” or “opening” near the point where two intersecting lines meet. While we need 
not expect to be injured by conflicting meanings of the word angle, we can use the term 
“geometric angle 8” to signify the “opening” between the two vectors of Figure 2.15. The 
number @ is then a measure of the size of the geometric angle 8, and we have satisfactory 
but somewhat awkward terminology. 
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of the projectionofvonu. Incaser/2 < 0 < r, thescalar product is the 
negative of this number. The definition of u-v implies that wv = 0 if 
and only if cos 0 = 0. Thus u-v = 0 if and only if u and V are orthogonal 
(that is, perpendicular to each other). Those who are or want to be con- 
versant with principles of physics can note that if a particle P moves from 
the tail to the head of the vector u with the constant force V acting upon P 
during the motion, then u-vis the work done by the force during the motion. 

Referring to Figure 2.181, we can see that if u and V are collinear 
vectors (vectors which lie on the same line), then 6 is 0 or r, so sin 6 = 0. 
In this case the vector n of the formula (2.17) is not determined, but 
u xv is defined to be 0 anyway. Henceforth, we suppose that 0 < 
6 <r. In this case, the vector n is the unit normal to the plane of u and 
v which is determined by the right-hand rule. A right hand is so placed 
that the thumb is perpendicular to the plane of u and V and the fingers are 
parallel to this plane and point in the direction that a line rotates in passing 
over the geometric angle 0 from u tov (not v to u). The unit normal n 
is then the vector which has the direction of the thumb and which is one 
unit long. From Figure 2.181 we see that |v| sin @ is the altitude of the 
triangle of which the vectors u and v form two sides. It follows from 
(2.17) that u x v = 2An, where J is the area of this triangle. It must 
always be remembered that the vector product u X V is a vector which, 
when it is not 0, has the direction of the thumb when the right-hand rule 
is applied. Moreover, it is necessary to observe and remember that, 
except when u x v = 0, the vector v X u is not the same as the vector 
u xv. After having found u x v by the right-hand rule, we must flip 
the hand over so that the thumb points in the opposite direction to find 
v XU, and it follows that 


(2.182) Vvxu = —U XV. 


Anyone can attain complete understanding of these matters by making a 
few experiments in which two pencils (representing vectors) are held in 
the left hand while the right hand is used to determine the direction of 
their vector product. While vector products appear infrequently in this 
book, they have many important applications. 

Finally, we call attention to some simple formulas that are easy to use 
but are not so easy to prove. The basic formula, which is proved in 


Problem 17 below, is 


(2.183) (ui + U) (Vi + Veo) = Ure(V1 + Ve) + Ur (Vi + Vo) 
= UrVy + UrV + UzrVı + U2Ve. 


Analogous formulas hold when the parentheses in the left member contain 
sums of more than two vectors. Moreover, correct formulas are obtained 


by replacing the dots by crosses. Proofs of this fact are given in text- 
books on vector analysis. 
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Problems 2.19 


I As in Figure 2.191, let 4, B, C, - - - , H be equally spaced points on the 
line P,P2 with C = Pı and G = P}. Apply appropriate definitions of the text 
to show that 

Pi\D =4P;P, PE = 4$P,P,, P\F =4P\P, PG = PiP; 


eel 


PIA = {PPa PiB=—4P,P, Pid = -PiPPa DE = 4PiP, 


Observe that the vector P,P lies on the line P,P, if and only if there is a scalar 
(or number or constant) A (lambda) such that 


P,P = APP. 


Observe that the points P, and P2 separate the line into three parts and tell what 
values of A correspond to points in the different parts. 

2 Construct a figure similar to Figure 2.191 which 
shows points P, and Pz and also points 4, B, C, D for 
which 


P\A = -PPa  PiB = 4P;P,, 
PiC = $PP», PD = $PP 
3 Let O (an origin), Pi, and Pz be three points in E; Figure 2.191 


with Pı Æ P2 as in Figure 2.192. Verify that if P is a 
point on the line PPa then there is a scalar A for which 
PiP = \P:P2 = AOP: ~ OP;) 
OP = OP, + P,P = OP; + X(OP2 — OP;) 


SO 


OP = NOP. + (1 — NOP.. 
Figure 2.192 
Show that if M is the mid-point of the line segment P;P2, 


then _ _. _. 
OM = 3(OP, + OP.). 


4 Let i, j, and k be mutually perpendicular vectors 
which run along bottom and back edges of a cube as in 
Figure 2.193. Let Pı, Pe, P3, P, be the mid-points of the 
top edges upon which they lie. Show that 


h _1 —=—_ _ 1s 
OP, =7zi +k, OP,=i+3j+k, Figure 2.193 


and write similar formulas for OP;, OP, and OP». 
5 Supposing that the vectors i, j, k of the preceding problem are unit vectors, 
apply the definitions of products of vectors to prove that 


ii=1, ij=0, ixi=0, ixje=k. 


Hint: In each case, write the definition of the product and use the angle correctly. 
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6 Weare going to prove a theorem in geome- 
try. Asin Figure 2.194, let Pı, Po, P3 be vertices 
of a triangle and, for each k = 1, 2, 3, let M, be 
the mid-point of the side opposite Py. For each 
k, let Cp be the point of trisection of the segment 
P,,M,, for which |P,C,| = #1P,M;|. We will prove 
that the points Cı, C2, C3 coincide and we may 
Figure 2.194 put C = Cı = C: = C3. The line segments PM, 

are, in geometry, called medians of the triangle 
Thus, our result shows that the three medians intersect at a point C which trisects 
each of them. For reasons which we shall not now discuss, the point C is the cen- 
troid of the triangular region T bounded by the sides of the triangle. To prove 
our result, let O be any point and show that 


OC, = OP; + $PiM, = OP; + $(OM, — OP:) 
—» 2/0P:+ OP; x=\ OP: +OP: + OP; 
= OP, +5 (2 _ op) = EHEAR, 


The way in which P;, P2, and P; appear in the result can make us feel sure that 
we must have 


OG, = OP; + oe + OP} 


for each k. However, calculate OC; and OC; and show that it is so. 

7 The line segment PC, joining a vertex 
P, of a tetrahedron to the centroid Cp of the 
opposite face, as in Figure 2.195, is called a 
median of the tetrahedron. For each &, let Q, 
be the point of quadrisection of the median 
P,C;, for which |P,Q;| = £|P.C,|. Let O be any 
point. When & = 1, prove the formula 


—— OP, + OP, + OP; + OP, 
Figure 2.195 OO; = ny Se 


and then prove or guess that the formula is valid when & = 1, 2,3, 4. The point 
Q for which 


— OP, + OP: + OP; + OP, 
OQ = AE 


is the centroid of the tetrahedron. Thus the four medians of a tetrahedron 
intersect at the centroid, and this centroid quadrisects each median. 
8 Prove that the line segment joining the mid-points of two opposite edges 
of a tetrahedron contains and is bisected by the centroid of the tetrahedron. ' 
9 Determine whether, in all cases, the two line segments joining mid-points 
of opposite edges of a quadrilateral must intersect and bisect each other. Be 


sure to recognize that a quadrilateral in Es need not have all of its vertices in the 
same plane. 
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10 Prove that if Pi, Pe, Ps, P4 are the vertices of a square having its center at 
C, then 


OG = Pi +OP, + 0P, + OP, 
Hint: For each k ) k we can w write OP, = OC + CP, and notice that something ¢an 
be said about CP, and CP, when n Pr and P, are opposite vertices of the square. 

11 Prove that if Pı, Po, - - + , Ps are the vertices of a cube having its center 
at C, then 


OG = Pi + OP + + + OP, 


12 Figure 2 196 shows eight vectors i, j, —i, —j, i’, etcetera which are unit 
vectors (vectors one unit long) having their tails at the origin of an xy plane and 
having their tips on the unit circle with center at the origin. 
Discover reasons why the first pair of equations 


i+j=vV27 i=— (i — j’) 


~i+j=vV2j’ j= — (ï + j’) 


is valid, and then solve the first pair to obtain the second 
pair. 

13 The vectors u, v, w run in positive (counterclockwise) directions along 
three consecutive sides of a regular hexagon. Express w in terms of u and V 
Hint: Sketch the hexagon and the line segments from the center that separate it 
into six equilateral triangles. Perhaps the simplest observation that can be 
made is u + v + w = 2v. 

14 From the vertices of a triangle, vectors are drawn to the mid-points of the 
opposite sides. Prove that the sum of the three vectors is zero. | 

15 What can be said about the location of Q if PỌ = PA+PB+PC+PD. 
where A, B, C, D are the vertices of a square and P is on a side of the square? 

16 Abilities to sketch figures and construct formulas involving vectors must 
be cultivated. As in Figure 2.197, let u be a unit vector 
(vector of unit length) having its tail at O. Show that vy 
uu = 1. Let v be another vector having its tail at the same y7 
point O. Show that the vector U defined by U = (v-u)u is u 
the vector running from O to the projection of the tip of v 
on the line bearing the vector u. Observe that the vector V Figure 2.197 
defined by V = v — U or by V = v — (vu)u runs from the 
tip of U to the tip of v. Verify that V is perpendicular to U by showing that 


U-V = (v-u)u-[v — (v-u)u] = 0. 


Figure 2.196 


U 


Remark: More opportunities to become familiar with these things will appear 
later. Meanwhile, we can note that we have seen the (or a) standard procedure 
for resolving a given vector v into vector components parallel and perpendicular 
to a given unit vector U. 
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17 Sketch a figure showing four points O, Pı, P2, and Q in Es and suppose 


that (00! = 1. Let Q, and Q: be the projections of P, and P» on the line OQ. 
Show that 


(1) (OP:0Q100 = 001, [P:Pr09100 = 0:0, 
(2) [((OP1 + PiP2)-0Q]0Q = OQ 

and hence that 

(3) (OP; + PiP2)OG100 = (OPO + P:P,00)00 


and therefore 

(4) (OP; + PiP) 00 = OP:00 + PiPr00. 

Remark: If we set u, = OP., uU: = P,P», and v = 00; this shows that the formula 
(5) (ui + U)V = Ury + Uv 


is valid when u; and ux are vectors and v is a unit vector. It follows from this 
that (5) is valid whenever uj, Uz, and v are vectors. With the aid of (5) and 
simpler properties of scalar products, we find that 


(6) (uy + ua) (Vi + Ve) = (Vi + Və) (u: + U2) 
= Vy (U, + U2) + Vor(Ui + U2) 
= (u, + U2)-¥1 + (U1 + U2)? 


and hence 


(7) (u: + u2) (Vi + V2) = Urvi + Urbe + UV) + Ue'Ve. 


This is the basic formula (2.183). 

18 This problem and the next involve some very simple but very important 
ideas. Let r be the vector running from the 
origin to P(x,y), the point P having coordi- 
nates x and y, asin Figure 2.198. Leti bea 
unit vector having the direction of the posi- 
tive x axis. Considering separately the cases 
in which x > 0, x = 0, and x < 0, show that 
xi is the vector running from the origin to the 
Fignre 2.198 projection of P upon the x axis. Then let j 

be a unit vector having the direction of the 
positive y axis and prove that yj is the vector running from the origin to the 
projection of P upon the y axis. Hint: All that is required is appropriate use of 
the definition of the product of a scalar and a 

Figure 2.199 vector. 

19 Asin Figure 2.199, let i and j be unit vectors 
having the directions of the x and y axes of a plane 
coordinate system. Let v be a nonzero vector 
running from the origin to P(x,y). Show that 


(1) V = xi + yj. 


Show that if 6, not necessarily confined to the 
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interval O £ 6 < r, is (as in trigonometry) an angle which the line from O to P 
makes with the positive x axis, then 
(2) v = |v| cos ĝi + |v] sin 6j. 


Let w be the vector obtained by rotating the vector v through a right angle in 
the positive (counterclockwise) direction, so that (as in trigonometry) 0 + 7/2 
is one of the angles which the vector w makes with the positive x axis. Show that 


(3) w = |v] cos (Ø +5)it lv sin (6 +3) 
and hence that 

(4) w = |v| (—sin 6)i + |v| (cos 6)j 

and 

(5) w= —yi + xj. 


Remark: While our present interest lies in vectors, our result is equivalent to 
the fact that, whatever x and y may be, if we start at P(x,y), the point P having 
coordinates x and y, and run in the positive direction along a quadrant of a circle 
having its center at the origin, we will stop at the point Q(—y,x). This fact 
implies and is implied by the formulas 


(6) cos (0 + 5) = —sin 0, sin (6 + 3) = cos 0 


which were used to obtain (4) from (3). 
20 Sketch some figures and discover the circumstances under which two 


nonzero vectors U and v are such that |u + v| = |u — v|. Then prove that 
(1) lu + v|? = (u + v)-(u+v) = uu + 2uv+ vv 

and 

(2) lu + v|? — ju — v|? = 4u-v. 


21 The span of the set of n vectors vi, V2, °° * , Vn is the set of vectors V 
representable in the form 


V = ¢1V1 + Cove. + °° * + Cnn, 


where ¢1, Co, © * * , Ca are scalars. Show that the span of the set of three given 
vectors Vi, V2, V3 is the same as the span of the set of three vectors Wi, Uz, Uz 
defined by the system of equations 


u =v +v 
U: = Vo + Vs 
U3 = Vi + V3. 


Hint: The proof consists of two parts. Suppose first that w belongs to the span 
of Uy, We, Us and seek an easy way to show that w must belong to the span of 
Vi, V2, Vs. It remains to suppose that w belongs to the span of Wi, V2, V3 and then 
show that w must belong to the span of Wi, Uz, Us. As a start, solve the given 
system of equations for Vi, V2, V3. One of the results is 


1 1 
y= su — 7U: + 7U 
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22 Using the definition of Problem 21, prove that if 
(1) €1¥1 + CoVe + cV = O, 


where ci, ¢2, ¢3 are scalars that are not all zero, then some one of the vectors 
Vi, V2, V belongs to the span of the other two. Remark: In this case the set of 
three vectors 21, 22, 23 is said to be a dependent (or linearly dependent) set. In 
case (1) holds only when cı = cp = cz = Q, the three vectors are said to be inde- 
pendent. These concepts are very important in several branches of mathematics. 

23 Perhaps we need a little experience drawing and adding vectors that all 
lie in the same plane. Start with a clean sheet of paper and draw unit vectors 
u and v headed, respectively, toward the right side and top of the page. Let 
Po be the point at the center of the page. More points in the sequence Po, 
Pi, Pa P3, © + + are to be obtained in the following random way. Start with 
k =1. Get two coins of different size and toss them so that each lands H (head) 
or T (tail). 


If big coin is H and small coin is H, let P-P = u. 
If big coin is H and small coin is T, let P, sP, = V. 
If big coin is T and small coin is H, let P,_,P;, 
If big coin is T and small coin is T, let P,_1P, = —V. 


l 
| 
= 


Then draw PoP.. With k = 2, repeat the coin tossing to locate P», and continue 
until Pio has been reached. [Itis not improper to become interested in the proba- 
bility that all of the points Po, Pı, © © - , Pio lie inside the circle with center at 
the origin and radius 5. This is a random-walk problem and such problems are 
of interest in the theory of diffusion. To prepare for investigation of these 
things, we must study analytic geometry, calculus, probability, and statistics. 

24 Using one die (singular of dice, a cube with six numbered faces) instead 
of two coins, describe a procedure for obtaining paths for use in random-walk 
problems in £3. 

25 The problem here is to grasp the meanings of the following statements 
when n is 2 and 3 and perhaps even when x is a greater integer. When Pi, Po, 

> , Payı are n +1 points that lie in the same En, but do not lie in an Eni, 

these points are the vertices of an n-dimensional simplex. A line segment which 
joins twa of these points is an edge of the simplex, so the simplex has n(n + 1)/2 
edges. To each vertex Pą there corresponds the opposite simplex of n — 1 
dimensions having vertices at the remaining points. A median of a simplex is 
the line segment joining a vertex P, to the centroid Æ, of the opposite simplex. 
The n + 1 medians of the simplex all intersect at a point B, and for each &, 


n 


n+ i1 


This point B is the centroid of the n-dimensional simplex and, when an origin O 
has been selected, the centroid B is determined by the formula 


P,B = PA, 


5g  OPi t+ OP: + OPs +--+ + OP ani 


n+l 


Remark: As the assertions may have suggested, simplexes of one, two, and three 
dimensions are, respectively, line segments, triangles, and tetrahedrons. When 
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n exceeds 3, the simplex does not have a plebeian name; it is an n-dimensional 
simplex. 


2.2 Coordinate systems and vectors in E, To locate a point in a 
plane (Euclid space E, of two dimensions), it suffices to have a two- 
dimensional rectangular coordinate system involving the two mutually 
perpendicular x and y axes with which we are familiar. To locate a point 
in Æ; (Euclid space of three dimensions), it suffices to have a three- 
dimensional rectangular coordinate system involving the three mutually 
perpendicular x, y, and z axes of Figure 2.21. To partially overcome the 
difficulties involved in picturing three-dimensional objects on a flat piece 
of paper, we consider the y and z axes to be in the plane of the paper which, 
like a blackboard in a classroom, is vertical and consider the x axis to be 
perpendicular to the y and z axes and sticking out toward us. We can 
also consider the x and y axes to be wires on horizontal fences separating 
rectangular fields and consider the z axis to be a vertical post at their 
intersection. 


P, (3,0,3)¢ 


Q,.(3,0,0) Q o 
P, (3,3,0) 


Figure 2.21 Figure 2.22 


To locate the point P (x,y,z) having nonnegative coordinates x, y, and z, 
we start at the origin, go x units forward (in the direction of the positive x 
axis), then go y units to the right (in the direction of the positive y axis) in 
the xy plane, and then go z units upward (in the direction of the positive z 
axis) to reach P (x,y,z). Ifx < 0, we start by going |x| units in the direc- 
tion of the negative x axis. Similar rules apply when other coordinates 
are negative. Figure 2.21 shows the point P(3,3,3) and, in addition, the 
projections Pz, Py, Pz, Qz, Qy, and Q; of this point on the three coordinate 
planes and coordinate axes. The figure is worth a little study. The 
eight encircled points lie at the vertices of a cube. Each of the edges is 3 
units long, but in the flat figure the distance between two points 1 unit 
apart on the x axis is only a half or a third or a quarter of the distance 
between two such points on the y and z axes. Further information about 
the natures of figures involving rectangular coordinate systems in Æ; can 
be obtained by looking at Figure 2.22. This shows a sphere with center 
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at the origin. The intersection (or section) of the sphere and the yz plane 
is a circle through the north and south poles which could be drawn with a 
compass. The intersection of the sphere and the xy plane is the equa- 
torial circle which appears in the flat figure to be a flattened circle. The 
intersection of the sphere and the xz plane is a circle composed of two 
meridians passing through the poles. The three coordinate planes, the 
xy plane, the yz plane, and xz plane, cut E; into eight parts called octants. 
The octant containing points having only nonnegative coordinates is 
called the first octant, and most people neither know nor care whether the 
others are numbered. 

We can learn about coordinate systems and, at 
the same time, prepare ourselves to solve problems 
of many types in mathematics and other sciences 
by introducing vectors. As in Figure 2.23, let i, j, 
and k denote unit vectors (vectors of length 1) in 
the directions of the positive x, y, and z axes. 
Since these vectors are orthogonal (which means 
Figure 2.23 that two different ones are orthogonal or perpen- 

dicular), normalized (which means that each one 
has unit length), and lie in Æ, (Euclid space of three dimensions), we say 
that they constitute an orthonormal set of vectors in E3. The definition 
of scalar products given in (2.16) implies that 


(2.231) ii=l, jj=1, kk =1 


and that u-v = 0 when U and V are two different ones of the vectors i, j, 
and k. Similarly, the definition (2.17) of vector products implies that 


(2.232) ixi = 0, j xj = 0, kxk =0 

and that 

(2.233) ixj =k, jxk =i, k xi =j 
jxi = —k, k xj = —i, ixk = ~j. 


To help remember these formulas, we can notice that if we write the 
ordered set 


(2.234) i, j, k, i, j, k 


of vectors, then the vector product of two consecutive ones in this order is 
the next but that changing the order of the factors changes the sign of the 
product. A rectangular coordinate system is said to be right-handed 
when the x, y, and z axes are so oriented (or arranged) that their ortho- 
normal set i, j, k of vectors is such that the formulas (2.233) are correct; 
otherwise, the system is left-handed. We shall use only right-handed 
systems so that we can always use the formulas (2.233). 
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As in Figure 2.242, let OP be the vector running from the origin O to the 
point P(x,y,z). The rules for multiplying vectors by scalars and for add- 
ing vectors imply that 


(2.24) OP = xi + yj + xk. 


The three vectors xi, yj, and zk are the vector components of the vector OP, 
and the three scalars x, y, and z are the scalar components.t| We can start 
getting acquainted with scalar products by observing that the angle 
between a vector and itself is 0, so 


IOP|? = IOPI |c IOPI cos 0 


= OP-OP 
= (xi + yj + 2k)-Gi + yj + zk) 
=x + yi +2 


and hence that 
(2.241) IOPI = Vx + y? + z. 


This important formula holds whether x, y, and z are positive or not. 
In case x, y, and z are all positive, we can give another proof of the 
formula by applying the Pythagoras theorem twice to the rectangular 


Figure 2.242 Figure 2.243 


parallelepiped (or brick) of Figure 2.243. Because the angles OQP and 
ORQ are right angles, the Pythagoras theorem gives 


OP|? = (ġie + 0P 
= |OR|? + |RQ|? + |QPI? 
= x? + y? + 2? 
and (2.241) follows. The same methods give the distance formula 
(2.25) [PPa = W (2 — m1)? + Ge — 1)? + (te — a) 


t When physicists talk about the components of a vector, they often mean vector com- 
ponents. When mathematicians talk about components, they usually mean scalar com- 
ponents. Hence the unqualified term “components” is ambiguous. We will have quite a 
bonfire if we burn all the books that tell confusing tales about components and projections 
and directed distances. 
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for the length of the vector PiP, that is, for the distance between P, and 
Pa. This formula will be derived more carefully in the next section. A 
sphere is defined to be the set of points P in E; which lie at a fixed distance 
r (called the radius) from a fixed point Po (called the center). It follows 
from this definition and the distance formula that the equation 


(2.26) (x — xo)? + (y — yo)? + @ — 20)? = 7? 


is the equation of the sphere with center at Po(%o,yo,29) and radius r. 

A paraboloid should be something resembling a parabola, because the 
Greek suffix “oid” means “like” or “resembling.” A paraboloid (or 
circular paraboloid) is defined to be the set of points P(x,y,z) in E} 
equidistant from a fixed point F which is called the focus and a fixed plane 
a which is called the directrix and which does not contain the focus. In 

order to obtain the equation of a paraboloid 
z in an attractive form, we let 1/2k denote the 


T VATN distance from F to r so that 1/2k = p and 
Rote k = 1/2», where p is the length of the per- 


pendicular from F tow. Then we put the z 

E axis through F perpendicular to m and put 
EA the origin midway between F and r as in 
i ——, Figure 2.27. The paraboloid is then the set 
Yy YY of points P(x,y,z) for which |FP| = |DPI, 


Yyg where D is the projection of P on the plane r 
x “lll? 

and has coordinates (x, y, —1/4k). The 
Figure 2.27 present situation is very similar to that in 


Section 1.4, where the equation of a parabola 
was worked out. A point P(x,y,2) lies on the paraboloid if and only if 
\FP|? = |DP| ^and hence, as use of the distance formula shows, if and only if 


1\? 1\ 
e+yt(2- a) -(:+5)° 


Simplifying this gives the more attractive equation 
(2.28) z = k(x? + y?). 


This is the equation of the paraboloid shown in Figure 2.27. 


Problems 2.29 


I Plot the points A(1,1,0), B(1,0,1), and C(0,1,1). 

2 In a new figure, repeat the construction of Problem 1 and insert the 
horizontal and vertical line segments running from 4, B, and C to the coordinate 
axes. 

3 In a new figure, repeat all of the construction of Problem 2. Then insert 
the point D(1,1,1) and the line segments joining D to 4, to B, and toC. Remark: 
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The final figure should look much like Figure 2.21. It seems that we do not 
inherit abilities to do things like this neatly and correctly. A little practice is 
needed, and it often happens that the first figures we draw are very clumsy. 

4 In the xy and xz planes, sketch circles of radius 3 having their centers at 
the origin. Then complete the sketch of the sphere of which these circles are 
great circles, that is, intersections of the sphere and planes through the center of 
the sphere. 

5 A spherical ball of radius 3 has its center at the origin. Sketch the part 
of it that lies in the first octant. 

6 Sketch a rectangular x, y, z coordinate system and observe that, in each 
case, the graph of the equation or system of equations on the left is the entity 
(point set) on the right: 


x= yz plane 
y= xz plane 
2 = xy plane 
x=y= % axis 
x= Z= y axis 
y=z=0 x% axis 
x=y=3z line through (0,0,0), (1,1,1) 


Remark: We make no effort to remember these facts, but whenever we see an 
x, y, % coordinate system, we should be able to observe and use these facts as they 
are needed. 

7 There are many points P(x,y,z) whose coordinates satisfy the equation 
y = 3. Some examples are (0,3,0), (0,3,1), (1,3,0), (1,3,1), and (—40,3,416). 
Sketch a figure and become convinced that the graph of the equation y = 3 is 
the plane m which passes through the point (0,3,0) and is both perpendicular to 
the y axis and parallel to the xz plane. Then, without so much attention to 
details, describe the graph of the equation z = 2. 

8 Plot the points (0,1,0) and (0,0,1) and then draw the line L through these 
points. Show that if P(x,y,z) lies on L, then x = O and y +z = 1. Show also 
that if x = 0 and y + z = 1, then P(x,y,z) lies on L. Remark: It is possible to 
write a single equation equivalent to the system x = 0, y +2 = 1. For exam- 
ple, each of the equations 


kl + ly +z—1| =0 
x+ (yt2—1)2?=0 


does the trick. It is fashionable to keep the two equations, and one who wishes 
to do so may learn something by thinking about the matter. 

9 Put the equation x? + y? + 22 — 2x — 4y + 8z = 0 into the standard 
form (2.26) of the equation of a sphere and find the center and radius of the sphere. 
Hint: Complete squares. Check your result by observing that the coordinates 
of the origin satisfy the given equation and hence that the distance from the 
origin to the center of the sphere must be the radius of the sphere. 

10 <A set S consists of those points P in Es for which | AP|2 + [BP]? = 16, 
where < is the origin and B is the point (0,2,0). Show that S is the sphere of 
radius 4/7 having its center at the point (0,1,0). Sketch the coordinate system 
and S. 
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11 Make a sketch showing an x, y, z coordinate system, the sphere S having 
the equation x? + y? + 2? = 9 and the line L having the equations x = 2, y = 2. 
Find the length of the part of L that lies inside S. Hint: Do not depend upon 
your figure to obtain precise quantitative information. Find and use the coordi- 
nates of the points on the sphere for which x = 2 and y = 2. Ans.: 2. 

12 By use of the distance formula, show that the equation of the set of points 
P(x,y,z) equidistant from two given points P,(x1,y1,21) and Pe(%2,y2,%2) can be 
put in the form 


(1) (x2 — *1) (« — z+) + (y2 — yı) (y — ut as 22) 


+ (12 — n) (z- 2) = 0. 


Remark: Our official introduction to planes in £3 will come in Section 2.4. Mean- 
while, we can observe that if Pı and P2 are distinct points, so that x2 * xı or 
yo É yı Or 22 ¥ Z then the set mentioned above is a plane and the equation 
which we have found is its equation. The equation has the form 


(2) A(x — xo) + Bly — yo) + C(z — z) = 9, 


where A, B, C are constants not all 0 and (%o,'yo,%0) is a point in (or on) the plane. 
13 Supposing that 4, B, C, xo, yo, Zo are constants for which 4, B, C are not 
all 0, show that the equation 


(1) A(x — xo) + Bly — yo) + C(z — z) = 0 


is the equation of a plane. Hint: Taking cognizance of Problem 12, solve the 
equations 


(2) x2 — x1 = A, y2 — yı = B, 22 — 2) = 


xit x: yı ty: _ zı Ze 
(3) 7 = Xo, 7 Yo, 7 


= Zo 
to obtain two distinct points P, and P% such that the graph of (1) is the set of 
points equidistant from P, and P2. 

14 The base of a regular tetrahedron has its center at the origin and has 
vertices at the points (2a,0,0), (—a,v/3 a,0), (—a,— 4/3 a,0). The other vertex 
is on the positive z axis. Find the coordinates of this other vertex. Check the 
result by using the distance formula to determine whether the edges have equal 
lengths. Finally, sketch the tetrahedron. 

15 Determine whether it is possible to multiply all of the coordinates of the 
points (24,0,0), (—a, V3 4,0), (—a,— V3 4,0), (0,0, 4/8 a) by the same con- 
stant A to obtain new points that are vertices of a regular tetrahedron each edge 
of which has length a. 

16 A set of points in Esis called a cone with vertex V if whenever it contains 
a point Po different from V it also contains the whole line through V and Po. 
Each of these lines is called a generator of the cone, the ancient idea being that 
if it moves in an appropriate way, it will “generate” the cone. A cone is called 
the circular cone whose vertex is V, whose axis is the line L, and whose central 
angle is aif V is on L and the cone consists of the points on those lines through V 
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which make the angle œ with L. Supposing that 0 < æ < 7/2, sketch the cir- 
cular cone whose vertex is the origin, whose axis is the z axis, and whose central 
angle is a. Then show that the equation of this cone is z? = k(x? + y?), where 
k = cota. 

17 Sketch a figure, similar to Figure 2.27, in which the paraboloid opens to 
the right along the y axis instead of upward along the z axis. Note that inter- 
changing y and zin (2.28) gives the equation y = k(x? + 2?) of the new paraboloid. 

18 Sketch a figure, similar to Figure 2.27, in which the paraboloid opens for- 
ward along the x axis instead of upward along the z axis. Note that interchang- 
ing x and z in (2.28) gives the equation x = k(y? + z2?) of the new paraboloid. 

19 Plot the eight points (+2, +2, +2) obtained by taking all possible choices 
of the plus and minus signs. Then connect these points by line segments to 
obtain the edges of the cube of which the eight points are vertices. Remark: 
One who finds this problem to be unexpectedly difficult need not be disturbed. 
The problem is unexpectedly difficult. 

20 We embark on a little excursion to learn more about our abilities to sketch 
graphs. The graph in £2 of the equation xy = 1 does not intersect the coordinate 
axes, and it consists of two parts (or branches) that are easily sketched. The 
graph in Ez of the equation xyz = 1 consists of those points in Æ; having coordi- 
nates (x,y,z) for which xyz = 1. The graph does not intersect the coordinate 
planes, and it consists of four parts, namely, the one containing some points for 
which x > 0, y > 0, z > 0, the one containing some points for which x > 0, 
y <0, z <0, the one containing some points for which x <0, y > 0, z < 0, 
and the one containing some points for which x <0, y <0,z2>0. Everyone ~ 
should discover for himself that it is surprisingly difficult (or hopelessly impossible) 
to draw x, y, z axes on a flat sheet of paper and sketch a figure which shows 
what these four parts look like and how they are situated relative to each other 
and to the coordinate system. 

21 Draw the rectangular coordinate system obtained from that in Figure 
2.23 by interchanging the x and y axes and the i and j vectors. Work out the 
formulas for the vector products of these vectors and show that the system is 
left-handed. As a safety measure, make a note on your figure that it is left- 
handed and be sure that your formulas for vector products are not remembered. 

22 It is not necessarily true that our study of mathematical machinery is 
made more difficult when we pause briefly 


to look at a rather complicated appli- z 

cation of it. Figure 2.291 shows a circle cls 

C in the yz plane which has its center at Pp’ 

the point (0,5,0) and has radius a. We Kp > Cc 


suppose that 0 < a< b. The surface $ O f X> | 

T obtained by rotating this circle C about r È, y 
the z axis is called a żorus. Thus a torus PR Yj 

is the surface of a ring or hoop that is “b a | 
more or less closely approximated by an 

automobile tire. As the figure indicates, Figure 2.291 
each point P on the torus T lies on the 


circle which (i) contains a point P’ on C, (ii) lies in a plane parallel to the xy 
plane, and (iii) has its center at a point Q on the z axis. When the angles 6 and 
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ġ are determined by P’ and P as in the figure, we see that 


(1) IQP] = [QP'| = b + a cos 0 

and that 

(2) OP = [QP] (cos ¢i + sin $j) 

SO 

(3) OP = (b + a cos 0) cos pi + (b + a cos 0) sin gj. 

Letting r be the vector running from the origin to the point P on T, we see that 
(4) r = QP + OQ = QP + a sin 6k 


and hence that 
(5) r = (b + a cos 0) cos di + (b + a cos 0) sin ġj + a sin Ok. 


Thus a vector £ having its tail at the origin has its tip on the torus T if and only if 
there exist angles @ and 0 for which (5) holds. Thus (5) is a vector equation of the 


torus. This implies that P(x,y,z) lies on the torus T if and only if there exist 
angles @ and 6 such that 


(6) x = (b + a cos 6) cos 9, y = (b + a cos 0) sin ¢, z=asin l. 


These are parametric equations of the torus, the parameters being ¢ and 9. 
The torus is the graph of the parametric equacions. 

23 Show that the x, y, z equation of the torus of the preceding problem can 
be put in one or the other of the two equivalent forms 


(1) (a? + p2? — x? — y? — z2)? = 4b? (a? — 27) 
(2) (b2 — a? + x? + y2 + z2)? = 4b2(x? + y?), 


Hint: One way to start is to square and add the first two of the equations (6) 
of the preceding problem. Remark: In case 0 £ b £ a, the equations of this 
and the preceding problem are not equations of a torus but they are equations 
of a surface. 

24 Sketch the visible edges of the solid that remains when a cube having 
edges of length a/2 is removed from the upper right front corner of a cube having 
edges of length a. Remark: This figure can easily become a favorite doodle, 
and perfectly normal people can become very much interested in it. 

25 Mr. T., a topologist, claims that a given right-handed rectangular coordi- 
nate system in E can, when considered to consist of three stiff wires rigidly 
welded together at their origin, be moved around in £, in such a way that it will 
coincide with any other such system but cannot be similarly moved into coinci- 
dence with a left-handed system. It is not always easy to understand such 
assertions and*to give proofs of them. It is sometimes easy to wave some arms 
around and give some more or less convincing arguments that could not be called 
proofs. The difficulty here is that those who wave arms and produce more or 


less convincing arguments sometimes reach erroneous conclusions. Anyone who 
wishes to do so may think about this matter. 
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2.3 Scalar products, direction cosines, and lines in E, Letu and v 
be vectors in Æ; having scalar components u1, u2, uz and v1, v2, v3 so that 


= ui + uj + uk 
vi + vj + nK. 


It may be helpful to look at Figure 2.311, which shows vectors u and v 
and the projections of their tips on the xy plane. 


Il 


(2.31) b 


Figure 2.311 


Since the scalar product u-v is defined by the formula 
(2.312) u-v = |u] |v| cos 9, 


it could be supposed that we should find cos @ in order to find u-v. It 
turns out, however, that there is a very simple and useful formula for u-v, 
and we can use this formula to find cos 6 whenever we want it because we 
know that 


(2.313) ul=Val pata, lvl = Vo toh + of. 
We find that 


(uid + uj + uk) (oii + vj + vK) 
uiie (oii + vj + vsk) + uz) (0i + 025 + 03K) + uK (0i + vj + ok). 


With the aid of (2.231), we see that this reduces to the very important 
formula 


uyv 


(2.32) UV = UD + UW + U33. 


In order to help remember this formula, we can remember that the scalar 
product of two vectors is the sum of the products of their scalar components. 


Use of (2.32) and (2.312) gives, when |u| |v| = 0, the formula 


U10 + U02 + U303 


(2.321) cos 0 = ral iv] 


which gives the angle @ between two vectors in terms of the scalar com- 
ponents of the vectors. This formula may be remembered. It is, in the 
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long run, more sensible to concentrate upon the two formulas 


u-v 
Uv 


ju] |v| cos 6 
101 + 202 + u33 


(2.33) 


and to realize that the formula for cos 6 can be obtained very quickly by 
equating the two formulas for u-v. 


When, as above, 


v = mi + vj + ork 


and |v| > 0, the vector in the parentheses in the right member of the 
formula 


Oon, n 
(2.34) v= {wi ( 1+ 25+ok) 


is the unit vector in the direction of v. It is possible, and sometimes 
thought to be useful, to recognize that the scalar components of the unit 
vector are the cosines of the angles a, B, y which the vector v makes with 
the unit vectors i, j,k. This is true because the formulas 


v-i = |v| cos a, v-j = |v| cos £, vek = |v| cos y 
vi = p vj = 0 vk = v, 
imply that 
2.341 21 = cos a, wt _ cos , 23 = cos y. 
CNW Cnn] 


The angles a, 8, y, shown in Figure 2.342, are called the direction angles 
of the vector v. The cosines of these angles are called the directzon cosines 
of the vector. Even those who do not like to prove formulas by use of 
special figures in Æ; should look at Figure 2.342 and see that the formulas 


Figure 2.342 


(2.341) can be proved by use of the formula that defines cosines in terms 
of coordinates and distances. Squaring and adding the members of the 
equations in (2.341) gives the formula 


(2.343) cos? a + cos? B + cos? y = 1 
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which provides an interesting way of making the more prosaic statement 
that the sum of the squares of the numerical components of a unit vector 
is 1. It is possible to put the formula (2.321) in the form 


(2.344) cos @ = cos a, cos a2 + cosf, cos B2 + cos Yı COS Y2, 


where @ is the angle between two vectors making angles a, 6), Yı and 
ae, B2, Y2 with the unit vectors i, j, k. The formulas (2.343) and (2.344) 
are impressive formulas, but persons who know about vectors may hold 
the view that the formulas are antiquated and may concentrate their 
attention upon the formulas (2.33). 

The following numerical example shows an application of the ideas of 
this paragraph. The vector in the left member of the formula 


. 3 4 
2i — 3j + 4k = V2 ( mi- seit a5) 
is the vector running from the origin O to the point P(2,—3,4). The 
length of the vector is the square root of the sum of the squares of its 
numerical components and hence is 1/29. The vector in parentheses in 
the right member is a unit vector in the direction of OP. The fact that 
its scalar components are cosines of direction angles is very often of no 
importance. 
__It is easy to adapt these ideas to obtain information about the vector 
P,P: running from P;(x1,y1,21) to Pe(x2,y2,22) as in Figure 2.36. Starting 
with the formulas 


(2.35) OP, = xi t+ yj +k, OP: = xa + yj + zk, 


it is easy to see that the rules for addition and subtraction of vectors give 
the formulas 


OP; + OP, = (x2 + xı)i + (y2 + yrdj + (z2 + 21k 
OP. — OP, = (xz — xı)i + (ye — yj + (Ze — zı)k. 


But OP; — OP, = P,P», and hence 

(2.351) PiP: = (x2 — m)i + (y2 — yi + (2: — dk. 
Therefore, 

(2.352) — |PiPA = Vr — x)? F (y2 — y)? + (22 — 2). 


If to simplify writing we let d denote this distance between P, and P», then 
we can put (2.351) in the form 


MoT igy ae #7 HK), 


“7 
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The vector in parentheses is then the unit vector in the direction of P,P», 
and its scalar components are the direction cosines of PiP}. 
Before introducing coordinate sys- 
Pi(x1, ¥1) 21) tems, we called attention to the fact 
P(x,y,2) that a point P lies on the line P,P: if 
o Pils, Ye, 22) and only if there is a scalar \ such 
that P,P = \P;P, and 


Figure 2.36 OP = OP, + (OP; — OP)). 


When the points have coordinates as in Figure 2.36, this equation can be 
put in the forms 


(2.361) (x — x)i + (y — yi)j + (z — zı)k 
= A(x: — x1)i + A(y2 — yi)j + A(z: — 21) 


and 


(2.362) x — xı = M(x: — x1), yy —y1 = My: — yı), 
Z2— 2% = A(ze — zı). 


In case x: Æ xı, Yo Æ yi, and Zz: Æ 2%, these equations hold for some À if 
and only if 
(2.37) * TAL YTV 2TH, 

x2 — Xı y2 — yı Z2 — 2 


In case x = xı, the condition on x is to be replaced by the condition 
x = %ı, and similar remarks apply to y2 and zə. With this understanding 
the equations (2.37) are equations of the line P,P, that is, equations that are 
satisfied by x, y, z when and only when P(x,y,z) lies on the line. The 
numbers x2 — %ı, Y2 — yi, Z2 — 2%, are the numerical components of a 
vector lying on the line P,P2, and we know how to find the direction 
cosines of this vector. 
It can be claimed that the equations 


(2.38) Xx- X% _ Yy © 2 


do not look like the equations (2.37) of a line, but we can put these equa- 
tions in the form 


x — x) y — yı Z% — 1 


— —— G ago A 


(xı +a)— xi (yit) y Gtod—4 


which does have the form (2.37). Thus the equations (2.38) are in fact 
equations of the line L which passes through the points P(x1,1,2%1) and 
P2(x1 + a, yı +b, z1 +c). The numbers a, b, c are the scalar com- 
ponents of the vector PiP: on L, and they determine the direction cosines 
of PiP: in the usual way. The equations (2.37) and (2.38) are called 
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point-direction equations of lines because they reveal coordinates of 
points and scalar components of vectors lying on the lines. 


Problems 2.39 


1 Write the intrinsic (not depending upon coordinate system) formula for 
the scalar product of two vectors u and v and be prepared to rewrite it at any 
time. 

2 Write the coordinate-dependent formula for the scalar product of the 
two vectors 


u = al + bj + ek, v= aa + b} + cok 


and be prepared to rewrite it at any time. 

3 Use the results of the first two problems to obtain a formula for the cosine 
of the angle between the two vectors in Problem 2 and be prepared to repeat the 
process at any time. 

4 Find the scalar product of the two given vectors and use it to find cos 9, 
the cosine of the angle between the vectors: 


(a) u = 2i — 3j + tk, v = 21 + 3j + 4k Ans.: cos 0 = $F 
(b) u = 2i — 3j — 4k, v = 2i + 3j + 4k Ans.: cos 8 -= 
(c) u = 2i + 3j + 4k, v = 2i + 3j + 4k Ans.: cos 9 = 

(d) u = 2i + 3j + 4k, v = —2i — 3j — 4k Ans.: cos 9 = —1 
(e) u = 214 3j, v=3i+4ij Ans.: cos 0 = az - i 


5 Determine ¢ so that the two given vectors will be orthogonal (or per- 
pendicular). 


(a) u = 2i — 3j + 4k, v = 2i + 3j + ck Ans.: c 
(64) u=i+jt+k, v=i+j+ ck Ans.: e = — 


6 For each vector v, find the unit vector V, in the direction of v. 


2 3 4 
(a) v = 2i — 3j + 4k Ans: n = 7351 — “7354 + a5 * 


(b) v= 7i Ans: v =i 
1 1 
(c) v=i+j Ans: n = Bit ai 


7 Supposing that v and w are orthonormal vectors and that u = av + bw, 
where a and b are not both zero, find the angle between u and w. Hint: Use the 
basic formulas 


uw = [ul |w] cos 6, u-w = (av + bw)-w. 


Ans.: cos 0 = b/~/ a? + $3. 
8 With the text of this section out of sight, sketch a figure showing points 
(0,0,0), Ps(x1,41,21), and P3(x2,y2,%2). Starting with the assumption that P(x,y,z) 
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lies on the line P,P: and hence that there is a number A for which P,P = APP, 
show how vectors are used to obtain the coordinate equation 


-e -o a 


Refer to Figure 2.36 and the text only if assistance is needed. 
9 Show that the equations 


(1) x — xı = A(x: — xi) y — yi = Ay: — yı), 2-1 = Ae — 321), 


from which the text of this section obtained the equations 


x — xı y — yı Zo Z% 
(2) z y, 


—_— eee D D 


can be put in the form 


(3) x = (x2 a) ta, y= (y2 YDA tyn 2 = Ge — 2) + 21. 


Then show that the equations 


(4) x = aÀ + b, y = aod + bo, z = ast + bs 
can be put in the form 

x— bi y—b _z—b: _ 
(5) | =n = a TN 


Remark: The equations in (1), (3), and (4) are called parametric equations of 

lines; different values of the parameter à yield different points on the lines. It is 

fashionable to use the letter \ for a parameter because people who work with 

Lagrange (French mathematician) parameters (or multipliers) get the habit; 

the letter ¢ is used when time is involved and sometimes when time is not involved. 
10 Look at the equations 


of the line L which passes through the points P;(«1,91,21) and Pe(x2,y2,z2) and tell 
why these equations are satisfied when x = x), y = y1, Z = zı and when x = xz, 
y = Yo, % = zə With this is mind tell how we can quickly find the coordinates 
of two points on the line having equations 


— — — eee 


I1 Find equations of the line L which passes through the points P;(2,—3, —1) 
and P2(1,2,3) and then, after putting z = 0, find the coordinates of the point 
(x,y,0) in which L intersects the xy plane. Ans.: x =4,y = —{, z = 0. 

12 This problem involves a little introductory information that everyone 
should have. The right-handed coordinate system shown in Figure 2.391 is the 
one we ordinarily use. When we are wholly or primarily interested in vectors 
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lying in the same plane, it is sometimes convenient to think of this plane as being 
the plane of the paper upon which we print or write and to use the right-handed 
coordinate system shown in Figure 2.392. Vectors of the form v = xi + yj + zk 
for which z = O then lie in the plane of the paper and figures showing them are 


Figure 2.391 Figure 2.392 Figure 2.393 


undistorted. When we are interested only in vectors lying in one plane, we may 
leave the z axis out of the figure and use Figure 2.393. The introduction is 
finished, and we come to our problem. Use the method, involving slopes, of 
Section 1.3 to show that tan 0 = yg when @ is the angle between the vectors 


u=2i4+3j, v=3i+4j 


running from the origin to the points (2,3) and (3,4). Use the method of this 
section to show (or to show again) that 


18 324 


cos § = —= = 


./325 325 


Then construct and use a modest but appropriate figure to show that the two 
results agree. To conclude with another story, we can remark that the method 
involving slopes may sometimes be preferred because it often gives answers 
without radicals when easy problems in £2 are solved. However, the scalar- 
product method is the more powerful method which works in Æ: and in £3 and, 
as some people learn, in En when n > 3. 

13 A vector v makes equal acute angles ô with the three positive coordinate 
axes. Find 6 (to find cos 6 is enough) (i) by use of an identity involving direction 
cosines and (ii) by using the edges and the diagonals of a cube having opposite 
vertices at (0,0,0) and (1,1,1). Make everything check. 

14 Referring to Figure 2.21, find the angle 0 between the two vectors running 
from the origin to the mid-points of the top edges of the cube that pass through 
P(3,3,3). Ans.: cos 0 = $. 

15 Let v be a nonzero vector and w a unit vector having their tails at the 
same point O. Show with the aid of a figure that the vector (v-w)w is the vector 
component of v in the direction of w, and that the vector v — (v-w)w is the 
vector component of V orthogonal (or perpendicular) to w. Remark: This prob- 
lem appeared among Problems 2.19 with different notation and additional 
information. 

16 When u = 0, each vector V is representable as the sum of a vector com- 
ponent cu and a vector component q orthogonal to u. Find q and find a way 
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of checking the answer when 


(a) u = 2i — 3j + 4k, v = 2i + 3j + 4k 
(6b) u=it+j, v=i 
()u=i+jt+k, v=i 
17 Show that the two vectors 
u, = cos Îi + sin 6j 
u: = —sin ĝi + cos Oj 


constitute an orthonormal set, that is, 
lu,| = 1, u| = 1, and uru: = 0. 
18 Show that the three vectors 


u, = cos ¢ sin ĝi + sin @ sin Oj + cos 6k 
us. = cos ¢ cos Îi + sin ¢ cos ĝj — sin 6k 
Us = —sin di +cos $j 


constitute an orthonormal set, that is, 
u,-U, is 1 when p = q and is 0 when 9 ¥ q. 


19 This problem requires that we learn the procedure by which we obtain 
a useful formula for the gravitational force F which is exerted upon a particle 
P* having mass mı and situated at the point Pi(x1,y1,21) by another particle P? 
having mass mz and situated at the point Po(x2,y2,z2). The very modest Figure 
2.394 can help us understand what we do. We start with the Newton law of 
universal gravitation, which is an “intrinsic law” that does not depend upon coordi- 
nate systems. This law says that there is a “‘universal constant” G, which 
depends only upon the units used to measure force, distance, and mass, such that 
a particle P] of mass m; at P; is attracted toward a second particle Ps of mass 

mz at P2 by a force F having magnitude 


E mm 
P, (1) IF| = 6 => 
F 
m; where d is the distance between the points. Our next 
P, step is to put (1) in the form 
Figure 2.394 (2) IF| = G =. 
[P\P.|? 


According to the Newton law, the direction of F is the direction of the vector 
P,P. (not P.P;). We have learned (and can relearn if the fact has been forgotten) 
that each nonzero vector F is the product of |F|, the length or magnitude of F, 
and the unit vector F/|F| in the direction of F. When F has the direction of 
P,P,, this unit vector is P,P./ IPP]. Therefore, 


(3) F=G Mime PP = Gmm: P,P: , 
[PiP]? |PIP: |P Pal? 


2.3 Scalar products, direction cosines, and lines in E; 75 


Thus when we know about vectors, we can put the Newton law in the following 
much more useful form. There is a constant G such that a particle P¥ of mass 
mı at Pı is attracted toward a particle P* of mass me at Po by the force F defined 
by (3). The problem which we originally proposed involved rectangular coordi- 
nates, and it should now be completely obvious that the answer to our problem 
is 
_ (x2 — x1)i + (y2 — y1)j + (z2 — 2) k 
(4) F = Gmm: [(x2 — xı)? + (y2 — yı)? + (z2 — z1) 5% 


Remarks: Persons who work with these things normally recognize the fact that 
we do not, in our physical world, have “particles”? concentrated at points. Itis 
sometimes possible, however, to obtain useful results from calculations based on 
the assumption that particles are concentrated at points. When this assumption 
has been made, we can complicate ideas and simplify language by replacing the 
concept of “a particle PY at the point P, having mass my’ by the concept of 
“a point P; having mass mı.” It can be insisted that this linguistic antic should 
be explained; otherwise, a serious student of mathematics is entitled to ask where 
the postulates of Euclid provide for the possibility that some points can be 
heavier than others. Finally, it can be insisted that the formulas (3) and (4) 
should not be remembered; it is better to know (2) and to understand the very 
simple process by which we use it to obtain the more useful formulas (3) and (4) 
whenever we want them. 

20 Let Pi(x1,y1,27) and Po(x2,y2,22) be two distinct points in E and recall 
(or prove again) that to each point P(x,y,z) on the line L containing P, and P: 
there corresponds a number J for which 


(1) x = xi +A: — x), y=yeotAe—y1), 2 = z + A(z — z1). 


While we may not yet know why such matters are important, we can observe 
that the equations 

x’ = anx + ary + aigz + bi 
(2) y’ = anx + azy + azz + de 
z = azı% + azo + azaz + bs, 
in which the a’s and b’s are constants, provide mathematical machinery into 
which we can substitute the coordinates (x,y,z) of a point in £ and thereby obtain 
the coordinates (#’,y’,z’) of a point (usually another point) in Es. It is very 
helpful to think of the equations (2) as being a transformer which transforms a 
given point P(x,y,z) into a transform (or transformed point) P’(x’,y’,z’). Thus 
the transformer goes to work on P(x,y,z), which engineers and others call an 
input, and produces P’(x’,¥’,z’), which they call an output. This problem con- 
cerns transforms of points that lie on the given line L. Find, for each A, the 
transform P’(x’,y’,z’) of the point P(x,y,z) on L whose coordinates are given by 
(1). Ans.: The result can be put in the form 


x’ = xı + Alx — x1), y= yi + A(y2 — y1), zl = 21 + Alz — 21), 


where 


V4 

XE = 4%k + ayk + Ai3%~% + bı 
V4 

Ve = Aaix~ + azyk + az3%k + bz 
rA 

Ze = azı%k + a32yk + ass% + b3 
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when k =1 and when k =2. Remark: In case x, = x1, yz = yi» z = z, 
the transforms of the points on L all coincide with the point («1,y1, 21). In case 
xo É x4 OF Yo Æ y4 OF z% £ zı, the transforms of the points on L constitute the 
line L’ containing the distinct points Pi (24544521) and Po(xo,yo,22) and, moreover, 
the transforms of points on L between P and Pz lie on L’ between P} and P}. 

21 Without use of a figure, suppose that @ is a given number for which 
0 < 80 < 7/2 and find and simplify the condition that numbers x, y, z (not all 
zero) must satisfy if the vector 


r = xi + yj + zk 


makes one of the angles 6 and  — 6 with the positive z axis. Ans.: 
= +|r| |k| cos 6 


and z? = c2(x? + y?), where c = cot 0. 

22 Two distinct (different) points Po and P;, together with a number @ for 
which 0 < 0 < 7/2, determine a right circular cone consisting of Po (the vertex) 
and those points P for which the vector PP, makes the angle 0 or m — 0 with 
the vector P,P;. Show that the intrinsic equation of the cone is 


(PoP PoP)? = [PoP] PP]? cos? 0. 


Supposing that Po, Pı, and P have coordinates (%o,¥yo0,20), (%1,41,21), (%,¥,%), and 
that 


(xı — xo)i + (yı — yo)j + (zı — 2o)K = Ai + Bj + Ck, 


find the coordinate equation of the cone. Ans.: 


[A(x — xo) + Bly — y) + Cl — »))? 
= (4? + B? + Cle — xo)? + (y —y0)? + (z — z0)?] cos? 6 


23 The vertex of a right circular cone is at the point (0,0,4), the axis of the 
cone is parallel to the vector i + j, and the lines on the cone make the angle 7/4 
with the axis of the cone. Find and simplify the equation of the cone. Ans.: 
2xy = (z — h)?. Remark: Putting z = 0 shows that the graph in the xy plane 


having the equation xy = h?/2 is a conic section, that is, the intersection of a 
right circular cone and a plane. 


24 Prove that the graphs of the equations 


Ax? + By? + Cz? + Dey + Exz + Fyz = 0 


and 


x? + xyz + y? = 0 


are cones with vertices at the origin. (See Problem 16 of Problems 2.29 for 
information about cones.) 
Figure 2.395 25 Let Pr(xn,ye,%), k = 1, 2, 3, 4, be four given points 
determining two skew (noncoplanar) lines PP and PsP, as 
in Figure 2.395. Many persons, including some who are not 
easily excited, become quite interested in the problem of 
determining a point P’(x’,y’,z’) on P,P. and a point 
P” (x”,y”,z") on PaP, such that the line P/P” is perpendicu- 
lar to both P,P and PsP, Figure 2.395 may seem to be 
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quite mysterious until it is realized that the lines PP and P3P, might be hori- 
zontal, while P'P” might be a vertical line perpendicular to both of them. We 
shall solve this problem with the aid of vectors. The first step is to observe 
that the coordinates of P’ and P” can be written down easily if we find constants 

à (lambda) and u (mu) such that PiP = APP, and P,P” = uPsP. Therefore, 
our real problem is to find À and u. Starting with the formula 


(1) P'P” = PP, + PiP, + PsP” 
we find that 
(2) P'P” = —)dPiP2 + P:P, + PPs. 


The requirement that PP” ‘be be perpendicular to to P,P, and to PsP, i is equivalent to 
the requirement that PP". P.P, = 0 and PP” PP, = 0. This is equivalent to 
the requirement that à and yp satisfy the equations 


(3) APPa PiP: — uPsP¢ P,P, = PiP P:P 
\PiPoPsP. — uPsP¢PsP, = PPs P:P.. 


The question whether these equations have solutions for A and u is now critical. 
Such questions are so often critical that Theorem 2.57 will soon appear. For 
the record, and perhaps for future reference, it can be noted that the determinant 
of the coefficients of À and yp is 


(4) — |P;P.|? |PsP,|* sin? 0, 


where @ is the angle between the lines PPa and P3P4. If these lines are not paral- 
lel, then the determinant is different from 0 and the equations uniquely deter- 
mine A and u. Our skew lines are not parallel. Therefore, A and u and hence 
P’ and P” are uniquely determined. 

26 This problem requires that we pick up the idea that vectors and scalar 
products are not unrelated to problems in statistics. Let be a positive integer 
and suppose at first that n = 3. Suppose n students took examinations in 
English and Mathematics and got grades ¢1, e2 ° °° , én and Mmg, mo,° °°, 
Mn. Let the mean (or average) of the English grades be E and let the mean of 
the Mathematics grades be M. For each k = 1,2, ---,nlet 


(1) up = lk — E, o = mp — M. 


The numbers ui, uz °° * , Un and 21, 9% ** * , % can be regarded as scalar 
components of vectors u and V in Euclid space E, of n dimensions. Show that, 
except in the trivial case in which |u| = 0 or |v| = 0, the cosine of the angle 
between these vectors is determined by the formula 


(2) cos 6 = Uv W101 + ua + °> Hunda 
lul Ivi Ve tat 2 + eVe+e+--- +0 

Remark: It is not difficult to develop enough geometry of £, to show that (2) 
is valid when n > 3 as well as when n = 3. In statistics, the last member of 
(2) is called the correlation coeficient of the English and Mathematics grades. 
Show that if this coefficient is 1, then the vectors u and v must have the same 
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direction and hence that there must be a positive constant A such that og = Au, 
for each & If possible, draw at least one substantial conclusion from this. 
Tell what we would conclude if the correlation coefficient turned out to be —1. 

27 Show how something in the preceding problem can be used to prove that 


lumi + ue + ++ t+ unol S Vai tad + pub Wor toh t o> +02. 


This is the Schwarz inequality. It is both interesting and useful. 


2.4 Planes and lines in E, Planes are important things, and we must 
think about them and the natures of their equations. To start the pro- 
ceedings, we can think of the top surface of a flat horizontal sheet of 
paper as being a part of a plane x. Let P, be a point in x. A vertical 
pencil then represents a vector V which is a normal to the plane. With- 
out bothering to decide how the fact is related to this or that set of 
postulates and definitions in Euclid geometry, we shall use the fact that 
a point P different from P; lies in ~ if and only if the vector P,P is hori- 
zontal, that is, perpendicular to V. Our next step is to apply this idea to 
a plane x, shown schematically in Figure 2.43, which is not necessarily 
horizontal. Let V be a vector of positive length which is perpendicular 
tox and which runs from the origin to a point (4,B,C) not necessarily in x. 
Let Pi(x1,y1,21) be a point in x. A point P then lies in x if and only if 


(2.401) VPP = 0. 


This means that either P = P, or P,P is a vector of positive length which 
is perpendicular to X. Thus a point P(x,y,z) lies in x if and only if 


(2.402) [4i + Bj + Ck)-[(x — x)i + (y — yj + (z — zı)k] = 0 


or 


(2.41) A(x — x) + Bly — y) + C(x — 21) = 9 
or 
(2.42) Ax + By+Cz+D=0, 


where D is the constant defined by D = — Ax, — By, — Czı. Itis easy 
to remember that the equation of a plane 

Figure 2.43 can always be put in the form (2.42), where 
A, B,C, D are constants of which 4, B,C are 

not all zero. Itis not so easy to remember 

z Vø (ABC) that (2.41) is the equation of a plane which 
Da passes through the point (x1,y1,2ı) and is 
PE normal to the vector having scalar com- 
ponents 4, B, C, but this should be done. 
To complete this little story, we must prove 
f that if 4, B, C, D are constants for which 


2.4 Planes and lines in E; 79 


A, B, C are not all 0, then the equation 
Ax + By+Cz+D=0 


is the equation of a plane. In case 4 ¥ 0, we can accomplish the result 
by putting the equation in the form 


4(x-—P)+ Bo -0) +0@-0) =0 


and noticing that it is the equation of the plane which passes through the 
point (— D/A, 0, 0) and is normal to the vector having scalar components 
A,B,C. Incase B #0 or C # 0, the proof is similar. 

The information which we have obtained can be useful in various ways. 
Suppose, for example, we are required to find the equation of the plane r 
which contains three given noncollinear (not on a line) points P1(*1,1,21), 
Po(x2,¥2,22), P3(%3,y3,%3). The obvious way to solve this problem is to use 
the fact that the equation of r must have the form of the first equation in 
the system 

Ax+By+Cz:+D=0 

(2.44) Axı + By, + Cza + D=0 
Ax, + Bye + Cz + D=0 
Ax; + By; + Cz; + D = 0, 


where 4, B, C, D are constants for which 4, B, C are not all 0. Since r 
contains P;, Pe, P3, the remaining three equations must be satisfied. In 
case 4 ~ 0, the equation of r can be obtained by solving the last three 
equations for B, C, D in terms of 4 and substituting the results in the 
first equation. Incase 4 = 0, we must have either B = 0 or C = 0, and 
a similar procedure will work. Except for cases in which some of the 
coordinates of the given points are zero, solving the problem in this way 
can be tedious. The next section will show how answers to this and 
other problems can be expressed in terms of determinants. 

We now look at an interesting procedure which often provides a good 
way to find the equation of a plane mı which contains two given points 
P3(%1,91,%1) and P2(x2,y2,%2) and satisfies another condition. We simplify 
matters by supposing that x: Æ xı, ye Æ yi, and z: Æ 213. We know that 
Pı and P, determine a line L and that the family F of planes r that contain 
Pı and Pz is identical with the family of planes r that contain L. We 
capitalize this fact. Ifa point P(«,y,z) lies on L, then 


x=% YM _~ 323- 
X2 — Xı Y2 — yı LQ — 21 
and hence 
2.45 ,(* 2% _ 274 YT _ *%7TU/\_g 
( ) = Z2 — 21 ty y2 — yı Z2— Z1 
or 


Ate 


(2.451) —~— @ a) +—* o — ) 22 @ n) = 
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where A (lambda) and y» (mu) are constants. Conversely, if A and p are 
not both 0, then (2.45) and (2.451) are equations of a plane containing L. 
To see this, we notice that they have the form 4x + By + Cz + D = 0, 
where 4 and B are not both 0, and that they are satisfied when x = xı, 
y = yi, 2 = 2, and when x = x3, y = y}, 2 = 22. Now we can solve 
problems. Suppose we want to find the equation of the plane x, which 
contains P, and Pz and also a third point P3(x3,y3,z3) not on the line P,P». 
Our answer will be (2.45) or (2.451), and A and u are determined such 
that they are not both zero and the formulas hold when x = x, y = ys, 
z = 23. If the coefficient of A in (2.45) is zero, we can take A = 1 and 
u = 0; otherwise, we can set u = l and solve for A. Suppose next that 
we want to find the equation of the plane r which contains P, and P: and 
is perpendicular to a given plane x’. Our answer will be (2.451) when A 
and yw are determined such that they are not both 0 and a normal to x 
is perpendicular to a normal to x’. Supposing that the equation of x’ is 


Ax + By+Cz+ D =0, 
we find that the normals are perpendicular when 


A 4 uB Atac 


X2 — Xi Yo NV Z2: — Zi 


(2.46) 0. 


It is possible to find values of A and u which satisfy this equation. 
Let d be the distance from a given point P;(x1,91,21) to a given plane x 
having the equation 


(2.47) Ax + By +Cz2+D=0. 


One way to find d is to find the point Po where the line through P, per- 
pendicular to # intersects and then find 
the distance from Peo to Pı. Whether this 
method is tedious or not can be a matter of 
opinion, but it is quite lengthy even when 
A, B,C, D, xı, Yı, zı are given to be nice little 
} integers. With the aid of vectors, we can very 
Py (24691021) easily find d in terms of 4, B, C, D, x1, Y1, 21. 
Figure 2.471 Let P(x,y,z) be any point in x, and let n be 

a unit normal tox. Then, as we can see with 
the aid of the schematic Figure 2.471 in which Po and P are on x and 
P,P. is a normal to r, 


(2.472) d = | iPP,| cos 0, = 'n:PP,|. 


But 
Ai + Bj + Ck 


T VAF BF C PP, = (xı — xi + (yı — y)j + (zı — 2)k. 
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Therefore, 
(2.473) d= | A(x; — x) + Bly - y) + CQ — 2) 


VA + B+ C? 
This looks quite simple but, since P is in x, the equation of r shows that 
—Ax — By- Cz = D 

and we obtain the more useful formula 


Ax, + By, + Czı + D ; 


One who must teach his little sister to start with (2.47) and get (2.48) can 
cook up a new five-step rule: (i) rub out the ‘“=0Q”; (ii) put subscripts on 
x, y, z; (ili) divide by V4? + B? + C3; (iv) stick on absolute-value signs; 
and (v) equate the result tod. It is, as a matter of fact, useful to know 
when and how it is possible to prepare instructions so explicit that routine 
chores can be performed mechanically and can even be performed by 
persons and machines unfamiliar with processes by which formulas are 
derived and combined to accomplish their purposes. 


(2.48) d = 


Problems 2.49 


1 Give geometric interpretations of the numbers xo, yo, 20, 4, B,C appearing 
in the equation A(x — xo) + Bly — yo) + C(z — zo) = 0 of a plane w and be 
prepared to repeat the process at any time. Ans.: See text. 

2 Write an intrinsic (not depending upon coordinates) equation of the plane 
x which contains a given point Po and is normal to a given vector V. Hint: If 
P is in x, then PoP must be perpendicular (or normal or orthogonal) to V. Ans.: 
V-PoP = 0. 

3 How can we derive the coordinate equation of Problem 1 from the intrinsic 
equation of Problem 2? Ans.: Set V = Ai + Bj + Ck and 


PoP = (x — x0)i + (y — yo)j + (2 — zo)k 


so that _. 
V-PoP = A(x — xo) + Bly — yo) + C(z — zo). 


4 Write an intrinsic (not depending upon coordinates) formula for the dis- 
tance d from a point P, to a plane x which contains a point P and is normal to 
a unit vector n. Hint: Construct an appropriate schematic figure and refer to 
Figure 2.471 and formula (2.472) only if assistance is needed. 

5 In each case, find the (or an) equation of the plane x which contains the 
given point and is perpendicular to a vector having. the given scalar components 
(or, in other words, perpendicular to a line having the given direction numbers). 


(a) (0,0,0); 1, 1,1 Ans.:xty+2=0 
(6) (1,1,1); 1,1,1 Ans.cxty +2 = 3 
(c) (1,2,3); 4, 5, 6 Ans.: 4(x — 1) + Sly — 2) + 6(2 — 3) = 0 
(d) (x1,51.%1); 4, B,C Ans.: A(x — x) + By — y) + Cz — 21) = 0 
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6 Because several of the coordinates are zero, it is relatively easy to deter- 
mine 4, B,C, D such that the equation dx + By + Cz + D = 0 is satisfied by 
the coordinates of the three points (3,0,0), (0,4,0), (3,4,5). Do it and thereby 
find the equation of the plane containing the three points. Ans.: 20x + 15y — 
12z — 60 = 0. 

7 A sphere of radius 3 has its center at the origin. Observe that it is not 
easy to sketch a figure showing the plane m tangent to the sphere at the point 
(2,2,1) and the point T where r intersects the z axis. Find the coordinates of T. 
Hint: The plane r is normal to the line from the center of the sphere to the point 
of tangency. Ans.: (0,0,9). 

8 If a, b, and c are nonzero constants, show that the equation 


*~4¥u2%—] 
a b c 


is the equation of the plane which intersects the coordinate axes at the points 

(a,0,0), (0,5,0), and (0,0,c). Find the scalar components of a normal to the plane. 
9 Find the distance from the origin to the plane of the preceding problem. 

Your answer is wrong if it does not reduce to 1/+/3 when a =b =c =1. 

10 A plane mı intersects the positive x, y, and z axes 1, 2, and 3 units, respec- 
tively, from the origin. A second plane r2 intersects each positive axis one unit 
farther from the origin. Would you suppose that 7, and rə are parallel? Find 
the acute angle 0 between normals to the planes. Ans.: cos 0 = +/2916/2989. 

11 Find the equation of the plane which contains the point (1,2,3) and is 
parallel to the plane having the equation 3x + 2y +z— 1 =0. Check the 
answer. 

12 Find the equation of the plane mı which contains the point (1,3,1) and is 
perpendicular to the line L having the equations 


(1) x=t, y =t, z=t+2. 


Hint: If you do not know what else to do, let £ = 0 to get a point P; on L and let 
t = 1 to get another point Pz on L. Then m must be perpendicular to PPa. 


Your answer can be worked out neatly by solving the individual equations in 
(1) for ¢ to obtain 


This shows that a normal tom has scalar components 1, 1, 1 and hence that the 
equation of y is 


(#-1)+% —-3)+@-1) =0 
orx+ty+2=5. 

13 Find the equation of the plane m containing the point Pi(1,1,1) which is 
perpendicular to the line L containing the points (0,1,0) and (0,0,1). Find the 
coordinates of the point P} where L intersects m. Then find [PiP]. Observe 
that the last number (which should be 1/3) is the distance from a vertex of a unit 


cube to a diagonal of a face not containing this vertex. Look at a figure and 
discover very simple reasons why 1 < [P,P,| < 2. 
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14 Determine the value of the parameter àA for which the two planes which 
have the equations 


2x + 3y +4z+5=0 
2x — 3y —dz — 5 = 0 


are orthogonal. Hint: Modest experiments with two sheets of paper enable us 
to capture or recapture the idea that two planes are orthogonal (or normal or 
perpendicular to each other) if and only if their normals are orthogonal. Ans.: 
= t. 

15 If B,C, D are constants for which B and C are not both 0, then the equation 
By + Cz + D = 0 is the equation of a plane m. Show that the vector V; with 
scalar components 0, B, C is normal to m. Show that the vector V2 with scalar 
components 1,0,0, is normal to the yz plane. Show that V; and V: are per- 
pendicular and hence that r is perpendicular to the yz plane. 

16 Consider again the equation By + Cz + D = Q0 or any other equation 
involving y and z but not x. Let us agree (this is an important definition) that a 
set Sı in £3 is a cylinder parallel to a line L if, whenever it contains a point Po, it 
also contains all of the points on the line Lo through Po parallel to L. Use this to 
show that the graph of the given equation is a cylinder parallel to the x axis. 
Solution: Let Po(xo,Y0,20) be any point on the graph of the given equation. Then 
the numbers xo, Yo, Zo satisfy the equation. Since x does not appear in the equa- 
tion, it follows that the numbers x, yo, Zo satisfy the equation for each x. This 
means that all of the points (x,o,z0) on the line Lo through (%o,yo,20) parallel to 
the x axis lie on the graph. Therefore, the graph is a cylinder parallel to the x 
axis. 

17 Supposing that B, C, and D are constants for which B and C are not both 
0 and D = 0, show that there is no number x for which the numbers x, 0, 0 
satisfy the equation By + Cz + D =0. What is the geometric significance of 
this result? 

18 Look at the equations 


—— oe è ë ë D è 


of the line containing two points P1(x1,91,21) and P2(x2,y2,32). Describe completely 
the graph of the equation obtained by deleting one of the members of this equality. 
Sketch the three graphs obtained in this way. 
19 Problem 12 of Problems 2.29 is of interest here. Solve the problem again 
and think about it. ; 
-20 Take a good look at the equation 


A+ 


A H E an) =0. 
-i 


(x — xı) + (y — yı) — 


XQ — X1 y2 — Yı 22 


(1) 


Supposing that P,(x1,y1,21) and Po(%2,y2,%2) are two points for which x: ¥ %1, 
yo É yı, Z2 É zı and that À and u are numbers not both zero, tell why (1) is the 
equation of a plane containing Pı and Pa Then study the text again and attain 
a better understanding of matters relating to (1). : 
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21 Supposing that 4, B, C are not all zero and that De Æ Dy, let mı and m3 be 
the planes having the equations 


(1) Ax + By +Cz+ D =0 
(2) Ax + By + Cz + D: = 0. 


Show that the planes have parallel normals and hence that the planes are parallel. 
Show again that the planes must be parallel by showing that they have no point 
of intersection; if the coordinates of a point P(x,y,z) satisfy (1), they certainly 


cannot satisfy (2). Supposing that A and u are constants not both zero, show 
that the equation 


(3) (Ax + By + Cz + Dı) + p(Ax + By + Cz + D) =0 


is the equation of a plane parallel to mı and mz unless A + u = 0. Supposing 
that Po(xo,yo,%0) is a given point, show that A and u can be so determined that the 
graph of (3) contains Po. Solution of last part: The graph of (3) will be a plane 
containing Po if and only if A + u Æ 0 and 


(4) N(Axo + Byo + Czo + Di) + w(Ax0 + Byo + Czo + De) = 0. 


Since Dı Æ De, the coefficients of A and yp are different numbers that we can call 
E and F. We can then put (4) in the form 


(5) NE + uF = 0 


and obtain a solution of our problem by setting A = F and u = —E because, in 
this case, A + u = F — E ¥ 0 and (4) holds. 
22 Supposing that the graphs of the equations 


(1) Ax + Biy + Cız + D: = 0 
(2) Ax + Bay + Cx + De = 0 


are distinct (that is, different) planes mı and 7, that intersect in a line L and that À 
and u are constants not both zero, show that the equation 


(3) Allix + By + Ciz + Di) + w(Aox + Bay + Ca + De) = 0 


is the equation of a plane m containing the line L. Solution: It is clear that if 
P(x,y,z) lies on L, then the coordinates of P satisfy both (1) and (2) and hence (3) 
To prove that (3) is the equation of a plane, we must prove that the three 
equations 


AA, + pA» = 0, AB, + uB: = 0, AC, + uC = 0 


cannot be simultaneously satisfied when \ and y are not both zero. This matter 


is more delicate. If we suppose that the three equations are satisfied and \ ¥ 0, 
we find that 


A, = (—p/\)4,, Br =(—u/A)B, Ci = (—p/AMCr 


and obtain a contradiction of the hypothesis that mı and me are not parallel. 
The case u 0 is similar. 


23 Using the hypotheses and equations of the preceding problem, show how to 
determine A and u such that (3) will be the equation of a plane r containing L 
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and a given point Po(%o,yo,%0). Hint: Consider separately the case in which Po 
is on L and the case in which Pp is not on L. 
24 Let m and T: be the planes having the equations 


2x + 3y + 42-—5 =0 


(1) x — 2y +3z—4=0 


Verify that mı and m2 do not have parallel normals. This implies that rı and r2 
must intersect in a line L. Observe that a point P(x,y,z) lies on L if and only if it 
lies on both mı and mz and hence if and only if its coordinates satisfy both of the 
equations (1). We should be able to find point-direction equations of L by 
finding the coordinates of two points P, and P2 on L and using them. Do this by 
finding x and y such that the point (x,y,0) lies on L and then finding x and y such 
that the point (x,y,1) lies on L. We now develop a simpler and more interesting 
method for finding equations of L. Show that if P(x,y,z) lies on L, then 


(2) 7x +172 — 22 =0. 


Observe that (2), the result of eliminating y from the equations (1), is obtained 
by multiplying the equations (1) by 2 and 3, respectively, and adding the results. 
Observe that (2) is the equation of the plane which passes through L and is 
perpendicular to the xz plane. Show also that if P(x,y,z) lies on L, then 


(3) 7y —22+3 =0. 

Discuss this matter. By solving the equations (2) and (3) for z, show that 
7x —22  7y+3 _ 

(4) 27 =" 2” =? 


and that these are equations of L. Observe that dividing these equations by 7 
puts them in the point-direction form 


25 Find the equation of the plane m which contains the point (1,3,1) and the 
line L having the equations 


(1) x =4, y =1, Z=t+ 2. 
First solution: The equation of m has the form 

(2) A(x —1) + Bly — 3) 4+ Ce —1) =0. 
Since r contains the given line, we must have 

(3) Alt —1) + B¢ —-3) + CE¢4+1) =0 
foreach ¢. Putting ¢ = 3 and then ¢ = 1 shows that 

(4) 24+4C=0, —2B+2C=0. 


Solving these equations for 4 and B and putting the results in (2) gives 
(5) —2C(x — 1) + Cly — 3) + Cz — 1) = 0. 
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Dividing by C and multiplying by ~1 gives 


(6) 2(@— 1) —-& — 3) -@—-1) =0 
or 
(7) 2x —y—z+2=0. 


Second solution: Eliminating ż from the first two and then from the first and last 
of the equations (1) shows that L is the intersection of the planes having the 
equations 


(8) x—y =0, x—-z+2=0. 
Each plane containing L has an equation of the form 
(9) A(x — y) + u(x — z + 2) = 0, 


where A and u are constants. The plane having the equation (9) contains the 
point (1,3,1) if and only if —2A + 2u = 0. Putting u =X gives the required 
equation 


(10) 2x—y—z+2=0. 


26 There are nontrivial applications of the idea that a point which lies in 
each of two nonparallel planes must lie on their line of intersection. Show that 
the equation 

ey o] 
a'b ` 


will be satisfied if, for some constant \, the two equations 


iial (2-2) -1-3 


both hold. Try to determine a geometrical interpretation of this result. Try 
to obtain another very similar result. 
27 Let Land L’ be the lines of intersection of the planes having the equations 


x, &% y x , 3 y 
+2 =r(1+3) *+2=u(1-3) 
L 2 K , b p 2 N , b 
a-i) =i} ($ -5) -1+7 
Work out the point-direction forms 
2a (1 — d2)d 
L ~~ i+ % T+ 2-0 
—(1 — d2)a 2rd ~ (1+ r2)c 
_ _2pa (1 — p?)d 
r. *"T+e 7) T+ _ 2-0 
—(1 — p’)a —2pb (1+) 


of equations of L and L’. 
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28 What can be said about the tip of the vector OP if 
OP = c:0P; + c:0P + csOPs, 


where ¢1, C2, ¢3 are scalars for which cı + c2 + c3 = 1? Ans.: It lies in the plane 
(or each plane) which contains Pı, P2, and P3. 


2.5 Determinants and applications Rectangular arrays of elements 


such as 
au aw G11 412 ALB Qi 
(2 a), G21 422 Az 421 
231 432 433 431 
are called matrices. For the present we may think of the elements a,x as 
being numbers. The middle matrix has three rows, the elements of the 
second row being @a1, 422, a23, and three columns (columns are things that 
stand in vertical positions), the elements of the third column being 
413, 293, 433. A matrix is square if it contains as many columns as rows, 
and in this case the number of rows is called the order of the matrix. 
With each square matrix we associate a number which is called the 
determinant of the matrix or simply a determinant. The symbols Ag, Ag, 
and A, appearing in 


an Q141 Ayo Aig 
2.61 A, = Ain A A. = 
( ° ) 2 = , ? 3 = |@o1 G22 4231, 
21 422 
- ! A31 432 433 
; 411 412 413 Au 


A21 Qaa? A23 A24 
Ag = 

431 4&32 433 434 

A41 Q42 Gag aa 


are numbers, not matrices. It will, however, be a convenience to say 

that az: is the element in the third row and second column of the deter- 

minant A; instead of saying that it is the element in the third row and 

second column of the matrix of which A; is the determinant. A little time 

spent learning about determinants can pay very handsome dividends. 
The number A, is defined by the formula 


(2.52) Ae = 411422 — 412421. 

This shows how to evaluate determinants of order 2. For example, 
1 0 
0 1 


a b 


a 10. 


| 
> 
| 
A~ 
| 
ON 
Ne 
Il 


= ab — ab = Q, p 


= 1, 


The definitions of As and A, are more complicated, and we introduce 
helpful words and notations. To each element a; of a determinant 
there corresponds the minor A; which remains after the row and column 
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containing a;, have been covered or removed. Thus for the determinant 
A3, we have 


A22 A23 A21 423 Q11 443 
(2.53) du = > Ap = , An = 
A32 433 A31 433 A31 433 
and for the determinant A, we have 
A22 A23 G24 A21 Q23 A24 411 @Q12 Aja 
Ay = | 232 33 4341, Aw = | A31 433 4341], Ao = | 431 432 34 | 
A42 G43 A44 A41 443 44 A41 @a2 Aaa 


While it is possible to give more gruesome definitions, the number A, 
can be defined by the formula 


(2.54) A; = anuli — arfi + 413413 
or 
A22 A23 A21 ~ 423 21 422 
A3 = 411 — A12 + 413 ? 
Q32 433 431 433 431 432 


which makes sense because we know how to evaluate determinants of 
order 2. The above formulas give the expansion of A; in terms of the 
elements of the first row. It can be proved that the same number A; is 
furnished by expansions in terms of another row or any column. Thus 


A3 — 21401 + a242 — d23A03 
. A3 A31431 — 32432 + 433433 
A; = 411411 — azán + 431431 
A3 = —@12412 + a242 — a32432 
A; = 433413 — az423 + 33433. 


In these expansions, the sign of the term involving a, is plus whenever 
the sum (or difference) of the subscripts is an even number like 0, +2, 
+4, -+ and is minus when the sum (or difference) is odd like +1, 
+3, +5,-+-+. To put this in other words, we can say that we get 
a plus sign whenever aj, lies on the main diagonal (running from the upper 
left corner to the lower right corner), and that we get a change in sign 
whenever we move one step right, left, down, or up. 

Progress to determinants of order 4 and more is now easy. The 
expansion of A, in terms of the elements of the first row is 


(2.55) Ag = an 4 — 412419 + 413413 — auu 


and A4 has seven more expansions, in terms of the other rows and the 
columns, all of which yield the same number Ay. The usefulness of 
the possibility of expanding a determinant of order 4 in terms of elements 
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of the third row is demonstrated by the expansion 


2 3 —-1 6 


3 1 6 
l a g 2j=3| 3 7 —2|+0 
1 5 4 


1 —1 5 4 


which reduces the problem of evaluating a determinant of order 4 to 
the problem of evaluating a single determinant of order 3. 

It should be known and remembered that a determinant is 0 if two 
of its rows (or two of its columns) are identical. For determinants of 
order 2, this is obvious because 


i |= ab — ab =0, p p ab ~ ab = 0. 
That 
abe ada 
a b c|=0, be b\|=0 
d e f c f c 


can be seen by expanding the first determinant in terms of elements of 
the bottom row and by expanding the second in terms of elements of the 
middle column. When the result has been established for determinants 
of order 3, the same trick enables us to establish the result for determi- 
nants of order 4, and so on. A simple modification of the above pro- 
cedure shows that if two adjacent rows (or columns) of a determinant 
are interchanged, then the value of the determinant is multiplied by —1, 
that is, the sign of the determinant is changed. It is sometimes useful 
to know the formulas 


ai ap a3 Qi aq 43 
(2.56) kbi kbo kb; = k bi bo b3 
C1 C? C3 C1 Co C3 
Qi ap a3 Qi A 43 Qa, a 43 
(2.561) [di ter bz+ c2 bytes} = jd: be bs} tle c2 c 
dy də d3 dı dz ds dı də d3 


and others like them. They can be proved by expanding the determi- 
nants in terms of elements of the middle row. The results are particu- 
larly useful when, for some constant k, we have cı = kdi, co = kdz, and 
c3 = kd3. In this case the last determinant in the above formula is zero 
and we obtain the formula 


Qy a2 a3 4i a2 4&3 
(2.562) bi + kdı bo + kde b3 + kd; = bi be b3 ° 
dı də d3 dı dz d3 
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What this and similar formulas say is that we do not change the value 
of a determinant when we add a constant multiple of the elements of one 
row (or column) to the elements of another row (or column). For 
example, we obtain the first equality in 


2 —3 1 2 1 1 2 1 —5 
1 —2 3;=/;1 0 3;=);1 0 0 
3 3 —5 3 9 —5 3 9 —l4 


by adding 2 times the elements of the first column to the elements of the 
second column, and then we obtain the second equality by adding —3 
times the elements of the first column to the elements of the last column. 
As we have seen, this reduces the problem of evaluating a determinant 
of order 3 to the problem of evaluating a determinant of order 2. 

The following two theorems, and their obvious modifications involving 
systems containing two or more than three equations, are very important. 

Theorem 2.57 The system of equations 


11%. + aix2 + a13%3 = yı 
AX, + Aor, + a23% = Veo 
31%. + a3z%2 + a33x%3 = V3 


has a unique solution (is satisfied by one and only one set of numbers 
%1, X2, %3) tf and only 1f 

@11 412 ALR 

Ao, 422 ao3| = 0, 

231 232 4&33 


that is, the determinant of the coefficients is different from Q. 
Theorem 2.58 The system of equations 


411%ı + AyexXe + a13%3 = 0 
azı% + a22%2 + Ao3x3 = 0 
A31X%1 + az2%2 + A33x%3 = 0 


has a nontrivial solution (a solution for which xı, x2, x3 are not all 0) if 
and only if 

Q11 12 413 

42, az 423} = 0, 

231 232 433 


that is, the determinant of the coefficients is 0. 


Proofs of these theorems belong in books and courses in algebra, but 
everybody can observe that the first system of equations 


2x1 + Ax = 8 2x + 4x = 8 2x + 4x = 


0 
xi — 2x. = 1, xı + 2x: = 0, xı + 2x2. = 0 
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has a unique solution, the second system has no solutions, and the third 
system has many solutions including the nontrivial one x; = 2, x = —1. 
Partly because of these two theorems, determinants are important. 
Determinants were originally devised to speed the process of solving 
systems of equations whose coefficients are given in decimal form. It is 
sometimes said that young algebra students should not be taught to 
solve systems of equations by use of determinants because the method 
is inefficient and yields too many errors; the method of successive elimi- 
nations is much better. This argument is vulnerable, because students 
who solve systems of equations by use of determinants acquire facility 
in use of determinants. In this course, it is recommended that deter- 
minants be used only for puiposes for which they are useful. 


Problems 2.59 
1 Show that 
AB €c 
1 2 —1|= 144 — 8B = 2C. 
3 4 5 


2 Supposing that Pi(%1,y1) and Po(x2,y2) are fixed points in E» show that 
the equation 


x y 1 

(1) xı yı 1]|=0 
X2 Y2 1 

has the form 

(2) Ax+Byt+C=0 


and that the graph contains Pı and Pe Comment upon the result. Solution: 
Expanding (1) in terms of the elements of the first row gives (2). The equation 
(1) is satisfied when x = x, y = yı and when x = x» y = ye because in each 
case the determinant has two identical rows. The equation (2) is the equation 
of a line unless 4 = B = Q, that is, unless P, and P; coincide. If Pı and Pz do 
coincide, the equation (2) becomes Ox + Oy + 0 = 0 and the graph is the whole 
plane. 
3 Letting A be the first determinant in the formula 


x y z 1 x— x% yoy 2-2 0 
Hoy a Ll} _t om yı zı 1} 
%2 Y2 %2 l X2 — Xi Y2 — Yl 22 — ZI 0 
x3 ys 23 l xs — xi ys — y z—z 0 


show how the formula can be obtained and show that 


x— x y—y 2-2 
A =j|x:— x% ye- y te at 
x3 — %1 Y3 — Yı 23 — ZI 
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Now write formulas for the coefficients 4, B, C in the expansion 
A = A(x — xı) + Bly — yı) + C(z — zı). 


Show that the graph of the equation A = 0 contains the three points P,(«1,41,21), 
Po(x2,y2,%2), P3(x3,¥3,23). Comment upon the result. 

4 Let |T| denote the area of the triangle 7 having vertices at the points 
P,, Po, P of Figure 2.591. With an eye on the figure, discover a way in which the 
formula 


(1) iT] = BES Ge m) + 2A ey — a) = EE — a) 


can be obtained. After expanding the products, show that some of the terms 
cancel out and that the formula can be 

P(x,y) put in the form 

(e) 


Po(x2,¥2) x y 1 
(2) IT] = +4 “1 Y1 1 
x2 Ye 1 


with the plus sign. It can be shown that 
(2) is correct with the plus sign when the 
vertices P, P;, Pz occur in positive (coun- 
terclockwise) order, and that (2) is cor- 
rect with the minus sign when the vertices P, P;, P2 occur in negative (clockwise) 
order. The members of (2) are 0 when the points are collinear. Many people 
remember this. 

5 This and the next two problems, together with Problem 11 at the end of 
the next section, show that if V is the volume of the tetrahedron (or simplex) 
in EB; having vertices (x,4,2), (x19121), (22,92,22), (23,93,23), then 


Figure 2.591 


where the sign is chosen such that V > 0 (or V = Oif we allow degenerate tetra- 

hedrons to be called tetrahedrons). Verify that this formula is correct with the 

negative sign for the special case in which a, b, c are positive constants and the 

four vertices are, in order, (0,0,0), (2,0,0), (0,5,0), and (0,0,c). Hint: Remember 

or learn that the volume of a simplex (tetrahedron) in Æ; is one-third of the 

product of the altitude (number, not line segment) and the area of the triangular 
ase. 


6 Letting D be the determinant of the preceding problem, show that 


x y % 1 x y z 1 

D= -| 7% TM =a ll jx-m yy z=% 0 
—~X2 ~y? —Ze —] x — x2 Y— V2 2 Ze 0 

~x; —y; —23 —1 x~ xX; y— ys z2—23 0 
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and hence that the formula for the volume of the tetrahedron can be put in the 
form 
x — xi YTV 27—32 
V = +4 xX — x2 Y—Y2 23> 2| 
x — %3 Y—Y3 2— 4 


7 Three vectors u and v and w with scalar components u1, ue, u3 and 91, 22, 93 
and wi, we, W3 have their tails at the same point P and are edges of a tetrahedron 
having volume X. Show that the formulas for V in the two preceding problems 
can be put in the form 


uUi ug 3 

1 
y = +g vı V2 93 l 
Wi W2 W3 


8 Supposing that Pi(x1,y1), Po(x2,y2), and P3(x3,y3) are three noncollinear 
(not on a line) points in Ee, show that the equation 


x+y? x y 1 
xi + yi x1 Yi l — 0 
xs H yo x2 yo l 
x3 +33 xs ys 1 
has the form 


A(x? + y?) + Bex +Cy+ D=0 


and that the graph contains the three points. Comment upon this result. 
9 Show that if P,(x1,y1), © © © , Ps(xs,ys) are five different points in £2, 
then the equation 
x xy y x y 
xi xyi yi %ı Jı 
X2 X2 Yo %2 y2 
X3 X33 Y3 %3 Y3 
xq Xa Yå X4 y4 
x5 xs Vs Xs Ys 


wo 
ve 
ee ee ee pů 


has the form 


Ax? + Bey + Cy? + Dx + Ey + F = 0 


and that its graph contains the five points. 

10 Use the ideas of the above examples to obtain the equation of a sphere in 
E; which contains (or passes through) an appropriate collection of given points. 

11 Let Py(x1,1%1)) © © © , Ps(x5,y5,%5) be five given points (the coordinates 
are supposed to be known numbers) such that no four of them lie in the same 
plane. Tell how to decide whether the line L containing Pi, Pais parallel to the 
plane r containing Ps, Ps, Ps and tell how to find the point of intersection when 
L intersects m. Solution: There are many ways to attack this problem. The 
following method gives answers in terms of the known coordinates with relatively 
little calculation. A point P(x,y,z) lies on the line L if and only if, for some con- 
stant À, 


(1) £ =x + Al- x), y =y +My y) 8 =m + Mas — 21). 
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This point P lies on w if and only if D = 0, where 


xı + A (x2 — xı) yı + A (y2 — y1) zı + A (z2 — z1) 1+0 
_ %3 V3 23 1 
2) D= x4 y4 %4 1 
X5 Ys 25 1 
But 
(3) D = D, + AD, 
where 
Xi yı Zi 1 X2 — Xi Y2 — Yı Z2 — By 0 
_ | *3 Y3 43 l _ X3 y3 23 , 
(4) Dı = x4 ya u 1P D: = X4 y4 Za 1 
xs ys %5 l %5 Js 25 1 


Now D, #0 because P;, P3, Ps, Ps do not lie in the same plane. In case 
Dz = 0, there will be no A for which D = 0 and hence no point P on L which lies 
in r, so L and z must be parallel. Incase D: ¥ 0, there will be exactly one X for 
which D = Q0, that is, A = —D,/D. Putting this in (1) then gives the coordi- 
nates of the point of intersection of L and =. 

12 Find out whether the plane containing two vertices of a tetrahedron and 
the mid-point of the opposite edge must contain the centroid of the tetrahedron. 

13 ‘This story can be read by anyone. It interests nearly everyone, but stu- 
dents who do not have a lot of time at their disposal can postpone the pleasures 
and benefits enjoyed by those who fully understand it. Matrices are often 
denoted by single letters. For examples, we can set 


2 1 3 3 1 -2 7 4 —6 
(1) P= (-: 2 -1) Q= (i —1 i) R= (-7 —6 ə) 
1 —1 1 0 1 —i 2 3 — 
1 7 6 1 —1 —3 
S = ( 6 —2 s) T= (2) U = ( 2} V = ( e) 
—4 3 —2 3 —1 —4 


We shall, among other things, develop enough algebra of matrices to enable us 
to understand and verify the fact that these matrices satisfy the conditions 


PQ=R QP=S PQ#QP, 
(2) QT=U, PU=V¥, P(QT)=V¥, (PQTH=V 
det (PQ) = det (QP) = (det P)(det Q). 


If 4 is a square matrix, we can denote its determinant by det 4. Two matrices 
4 and B are said to be equal, and we write 4 = B, if they (i) have the same num- 
ber of rows, (ii) have the same number of columns, and (iii) have equal elements 
in corresponding positions. We can multiply matrices by scalars (numbers), 
and we can add two matrices which have the same number of rows and the same 
number of columns. For example, when P and Q are the matrices displayed 
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above and & is a scalar, we have 


2k k 3k 5 2 1 
(3) kP = | —3k 2k -+) P+Q= (- 1 o) 
k —k k 100 


When we multiply a matrix by a scalar k, we multiply each element by k. When 
we add two matrices, we add them elementwise. These rules are very different 
from those applicable to determinants. The really crucial step in the develop- 
ment of a useful algebra of matrices is the determination, in terms of two suitable 
matrices 4 and B, of a third matrix C which we shall call the product 4B of 4 
and B. Let 4 have n rows and p columns and let B have g rows and n columns, 
so that the number of rows of 4 is the same as the number of columns of B. 
The product 4B is then a matrix C having p rows and g columns, the element 
c, in the jth row and kth column of C being determined by the formula 


(4) Cik = QD 1% + A, 2b 0 treet QsnDnks 


where 4,1, 2,2, ° * © , @ are the elements of the jth row of 4 and by, by, © © © , bnk 
are the elements of the kth column of B. To demonstrate that applications of 
this ritual are not fearsome, we let 4, B, C be the matrices P, Q, R defined above 
and see how the result 


2 1 3\ /3 1 —2 7 4 —6 
(5) —3 2 —1lj}il —1 1) ={ —7 —6 | 
1 —-1 1/ \0 1 —1 2 3 —4 


is obtained. To obtain the element 7 in the first row and the first column of the 
last matrix, we run one finger across the first row of the first matrix, and at the 
same time, run another finger down the first column of the second matrix and 
(with regret that we do not have three hands) write the sum 


(6) 23411430 =7 


of the products of the elements that our fingers encounter. ‘To obtain the ele- 
ment 4 in the first row and second column of the last matrix, we apply the fingers 
to the first row of the first matrix and the second column of the second matrix and 
obtain 


(7) 2-1 + L(—1) + 3-1 = 4. 


To obtain the element 3 in the third row and second column of the third matrix, 
we follow the third row of the first matrix and the second column of the second 
matrix to obtain 


(8) 1-1 + (—1)(—1) + 1-1 = 3. 


Nine such excursions suffice to work out the product of two matrices of order 3. 
Only three such excursions suffice to give the general formula 


Qy2 4&3 Q11%1 + AyoX%e + 413%3 
(9) do, ar aes an% + an% + axs 
431 432 Qs AsiXy + Asexe + a33%3 
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or the special case 


3 1 —2\ /1 —I 
(10) ( —1 1) (2) = ( 2} 
0 1 ~—l/ \3 -1 


If we let 


Ai) 212 33 x1 yı 
(11) A = |an az agp AX = | x2} Y= Yop 
231 432 433 3 y3 


then (9) shows that the whole system of equations 


Q131%1 + 412%? + aix = yı 
(12) a2%1 + a22%2 + a23%3 = Yo 
431%1 + azzx2 + a33x3 = V3 


is equivalent to the single matrix equation 4X = Y. It is standard practice 
to think of X and Y as being vectors having scalar components x1, x2, x3 and yı, 
ye, yz and to think of the matrix 4 as being an operator (or transformer) which 
transforms (or carries or converts) the vector X into the vector Y. In this and 
other contexts, matrices have very many important applications. One who 
wishes an easy exercise can prove that the formula 


âil &i2 43 @11 &21 431 1 0 0 
(13) Qa; Aas 43 i2 Az A302}; = 0 1 0 
231 @32 433 4213 G23 433 00 1 


is valid whenever the rows of the first matrix are the scalar components of three 
orthonormal vectors. The last matrix in (13) is called the identity matrix J 
(of order 3) because 4J = I4 = A whenever J is a square matrix (of order 3). 
If Æ is a square matrix of order n and det 4 = 0, then there is a unique (that is, 
one and only one) matrix B such that 4B = J, where J is the identity matrix 
of order n. This matrix B is such that 4B = BA = I. It is called the inverse 
of 4 and is denoted by 471. If AX = Y and det 4 = 0, then X = AY. If 


det 4 = Q, the matrix 4 cannot have an inverse because 
(14) det (AB) = (det 4)(det B) 


when 4 and B are square matrices of the same order and, moreover, det J = 1. 
One who is really ambitious may attack the famous eigenvalue problem for 
matrices. The problem is to start with a given square matrix Æ and learn about 
the scalars À (the eigenvalues) and the nonzero vectors X (the eigenvectors) for 
which AX = XX. 

14 This problem involves square matrices of order 2; analogous results hold 
for square matrices of greater order. Suppose 


41 = 41141 + Qi2V2 yı = bixi + bixo 
Anyi + aero, yo = beyx1 + doors. 


ta 
w 
il 
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so that z = Ay, y = Bx, and z = A(Bx). Show that z = Cx, where the matrix 
C is the product of 4 and B, that is, C = AB. Remark: The result shows that 
products of square matrices are defined in such a way that 4(Bx) = (4B)x. 

15 A two-by-two matrix of numbers a,j, determines the system of equations 


x’ = anx + ary 
y = anx + ao20y 
which transforms a given point (x,y) into its transform (x’,y’). Let three points 


(91); (£292), (x3,y3) have transforms (x191), (x22), (+333). As an exercise in 
multiplying matrices (or determinants), prove that 


, , 
*1 yı l aixi + ayi axi + azy | 
xo Se lj = Aix 9 + QZyoV2 GAaiXo + A2272 1 
! td 


xi ři l i an an 0 | 
= [Xo Ye 1 Q\2 a22 0 le 
xs ys 11/0 0 ıl 


Remark: With the aid of the results of this problem, we can prove some theorems 
in geometry. Let D = amaz — araz, so that D is the determinant of the matrix 
of the transformer. The area |T’| of the triangular region having vertices at the 
transforms is equal to |D] times the area |T| of the triangular region having ver- 
tices at the original points. The orientation (clockwise or counterclockwise) is 
preserved if D> 0 and is reversed if D<0. If D =Q, the transforms are 
collinear. The transformer conserves areas if and only if |D| = 1, that is, 
D = l or D = —1. If the transformer is isometric (conserves distances), then 
it also conserves areas, and hence |D| = 1. These results can be extended to 
give information about transforms of oriented simplexes having four ordered 
vertices (xr Yr Zk), k = 1, 2, 3, 4, in E3. When 


Xi Yi 7 1 


x z l lil Ayo 4AL 
2 2 2 
D=); , a 1P D = | an ax azl, 
3 3 3 
231 @32 433 
x4 Ya Z4 l 


the simplex is (by definition) positively oriented when D, > 0 and negatively 
oriented when Dı < 0. The transformer conserves volumes if and only if D = 1 
or D = —1, and it preserves orientation if and only if D > 0. 


2.6 Vector products and changes of coordinates in E, Letuandv 
be vectors in E; having scalar components 11, uz, uz and vı, 92, 93 with 
respect to a right-handed x, y, z coordinate system endowed with the 
usual unit vectors i, j, k. Then 


(2.61) u = ui + uj + uK, v = pi + vj + 23K. 
The vector (or cross) product has been defined by the formula 


(2.611) u x v = |u| |v| sin én 
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involving the angle 0 and the unit normal (thumb vector) n of Figure 
2.612. We now work out a convenient formula which gives u X V in 


Figure 2.612 


terms of the scalar components of u and v. Remembering that the 
vector product of two vectors depends upon the order of the factors, we 
shall be very careful. We have 


(ui + uj + usk) X (ori + o2) + oak) 
= ui X (ni + 2j + 03k) 
+ uj X (ni + 2j + 2K) 
+ usk x (oi + 2j + 03k) 


uxXv 


SO 
u xV = upi Xi + uwi X j + uwi x k 
+ umj Xi + uj X j + uaj x k 
+ 301K xi+ u300k xj + 303K x k. 


With the aid of the helpful fact about the vector product of two con- 
secutive vectors in the parade ijkijk, given in (2.234), we obtain the 
unlovely formula 


(2.613) u xv = ifu; — ugve) — j(uivs — ua01) + K(uyoe — uz). 
This looks better when we put it in the form 


Ug U3 
03 


uxv=i +k 


Uy U3 
01 03 


Uy te 
Ji 02 


Our next step is to allow ourselves the liberty of putting vectors into the 
first row of a determinant so we may put this in the form 


ij =k 
(2.62) uUXV=/u Ue U3 
v1 U9 03 


which is very easily remembered. It is the fashion to remember (2.62) 
and to expand the determinant in terms of elements of the first row when- 
ever this is desirable. When we must find the vector product of two 
vectors U and v defined by 


u=i-2%j+3k, v=2i—-j—k, 


all persons except typists and printers are happy to solve the problem 
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neatly by writing 


Hoi kl 
uxv=!1 —2 3[/=5i+ 7j 4 3k. 
2-1 1) 


We should all know enough to be able to guard against computational 
errors by observing that our answer is perpendicular to u because 
5 — 14+ 9 = 0 and is perpendicular to v because 10 — 7 — 3 = Q. 

To exhibit an application of vector products to a problem in geometry, 
we suppose that we are given two orthonormal vectors i’ and j’ in E3 
(this means that i’ and j’ are unit vectors and are orthogonal or per- 
pendicular) and are required to find a third vector k’ such that the three 
vectors i’, j’, k’ constitute a right-handed orthonormal system. The 
answer is given by the formula K’ = i’ xj’. This is so because 


(2.621) xj’ = 


i’| |j’/| sin ôn = n, 


where n is the unit thumb vector, and this is exactly what k’ must be. 
This problem can be put in a different form. Suppose we are given 
a right-handed rectangular x, y, z coordinate system endowed with the 
usual orthonormal vectors i, j, k. Suppose further we know the scalar 
components 411, 412, 413 and doi, 422, a23 Of two orthonormal! vectors i’ 
and j’ so that the coefficients in the first two of the equations 


i’ = ani + aiaj + ai3k 
(2.63) j = Aoi + azə] + do3k 
k’ = azil + azaj + az3K 


are known. The problem is to determine the numbers azı, a32, 433 SO 
that the three vectors i’, j’, k’ will form a right-handed orthonormal 
system. These numbers are determined from the formula 


i j k 
azi + azj + a33k = k’ = i’ x j ={a@u ar ag 
Qo; G22 423 


and the problem is soived or almost solved. To write more formulas is 
somewhat anticlimactic, but we can do it. Equating coefficients of 
i, j, and k gives 

431 = 412423 — 413422 
(2.631) 432 = 413421 — 411493 


433 = @11492 — 4@12Q01 


and then the problem is surely solved. 
Everyone should read the remainder of this section, but teachers who 
want to confine attention to other topics may inform their charges that 
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hasty reading and preliminary ideas will be satisfactory preparation for 
future encounters with the material. 

Here we begin to explore some of the reasons why the system (2.63) 
of equations is important. Suppose we have, as in Figure 2.64, two 


Figure 2.64 


right-handed rectangular coordinate systems in £3. The x, y, z coordi- 
nate system having origin at O and bearing an orthonormal set i, j, k 
of vectors is shown on the left. The x’, y’, 2’ coordinate system having 
origin at O’ and bearing an orthonormal set i’, j’, K’ of vectors is shown on 
the right. Our first task is to study the important systems of equations 


i = ani + arj + ark, = ani + aeij’ + azK’ 
(2.65) j = ani + azj + azK, = ayol’ + azj + azK' 
azi + azj + azk, K = ani + aoaj’ + azK' 


Qis pute 


PT 
l 


that relate the vectors in the two orthonormal sets. We observe a fact 
that can be considered to be remarkable even when we know the reason 
for it: the coefficients in the system of equations obtained by solving 
the first system for i, j, k are easily written down by interchanging the 
rows and columns in the first system. The reason is simple. The 
orthonormality of the vectors implies that, in each system, a is i'i, 
a2 is i'-j, ais is iK, az is j’-i, and so on until, finally, a33 is kk. The 
numerical coefficients in each row (and hence also in each column) in 
the right member of each system are the scalar components of a unit 
vector. The nine coefficients are cosines of direction angles, but we 
carefully avoid attempts to work out formulas by means of figures 
showing the nine angles. 

As soon as we look at the point P of Figure 2.64, we realize that P has 
two sets of coordinates, there being a set x, y, z for the unprimed or old 
coordinate system and another set x’, y’, z’ for the primed or new coordi- 
nate system. If we know enough about the relative positions of the 
two coordinate systems, we should be able to find one set of coordinates 
when we know the other set. We shall solve this problem with the aid 
of vectors. To specify the relation between the two coordinate systems, 
we suppose that, with reference to the unprimed coordinate system, the 
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coordinates of O’ are xo, yo, Zo, so that 


(2.66) OO’ = xi + yoj + zok, 


and that the unit vectors i, j, k and i’, j’, k’ of the two coordinate systems 
are related by the fundamental formulas (2.65). It is now surprisingly 
easy to solve our problem. Let P be a point in E; having unprimed 
coordinates x, y, z and primed coordinates x’, y’, 2’, so that 


(2.661) OP=xityjtek, OP = xi + yj’ + vk’. 
Putting the formula OP = 00'+0'P in the form OP — 00’ = OP 


then gives 
(2.662) (x — xo)i + (y — yoj + (z — zok = xi + yF + ak’. 


An expression giving the right side in terms of i, j, K is obtained by 
multiplying the members of the first three equations in (2.65) by x’, y’, 2’, 
respectively, and adding the results. The coefficients of i, j, and K turn 
out to be, respectively, the right members of the equations 


= , , , 
x — xo = anx + any + azz 
/ / 4 
y — Yo = arx + azy + A322 
/ / ! 
Z — Zo = a% + a23y + a332. 


These equations therefore result from equating the coefficients of i, Jj, 
and k in (2.662). Transposing gives the formulas 


x xo + aux’ + any + azt 
(2.67) y = yo + anx’ + azy’ + asz 


z = Zo + Ay3x! + azy" + A332 


I 


which express the unprimed coordinates of a point in terms of the primed 
coordinates of the point. A very similar procedure gives the formulas 


x = Xo + anx + 4412Y + a13% 
(2.671) y’ Yo + anx + a227 + 4232 
zo + asx + azy + 4332 


n 
I 


which express the primed coordinates of a point in terms of the unprimed 
coordinates of the point. The formulas (2.67) and (2.671) are known 
as the formulas for changes of coordinates. The formulas (2.67) are 
often used to convert an equation involving coordinates x, y, 2 into a 
new equation involving new coordinates x’, y’, z’. As can be expected, 
it is sometimes far from easy to so determine the numbers xo, Yo, Zo 
and ap in (2.67) that the new equation will have the simplest possible 
form. For the present we do not need to know much about these 
matters, but we should know that there are situations in which one 
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particular coordinate system is better than others and that there exist 
formulas relating the coordinates in two different coordinate systems. 

It is sometimes said that fundamental problems in analytic geometry 
are not adequately covered in textbooks that combine the study of 
analytic geometry and calculus. Much more analytic geometry will 
appear later in this textbook. Meanwhile, we consider a fundamental 
problem in analytic geometry that is sometimes ignored in elementary 
geometry books. Suppose we say, with reference to some rectangular 
x, y, 2 coordinate system, that a set S in £3 is a quadric surface if it is 
the set whose points P(«,y,z) satisfy an equation of the form 


(2.68) Ax? + By? + Cz? + Dxy + Exz + Fyz + Gx + Hy + Iz+ J =0, 


where the coefficients 4, B, C, D, E, F are not all zero. Our big question 
is the following. Can it happen that Miss White chooses a particular 
x, y, z coordinate system and finds that a particular set S* is a quadric 
surface because there do exist coefficients 4, B, - - - , E not all 0 such 
that S* is the set of points P(x,y,z) for which 4x? + By? + -+ - =0, 
while, at the same time, Mr. Black chooses another x, y, z coordinate 
system and finds that the same set S* is not a quadric surface because 
for his system the required coefficients do not exist? If the answer is 
affirmative, then the above definition of quadric surface and the above 
set S* should be placed in the museum of the SPC (Society for the Pro- 
motion of Confusion). It can be shown that the answer is negative 
and hence that the definition of quadric surface does make sense. To do 
this, we let x’, y’, 2’ denote the coordinates of Mr. Black and substitute 
the values of x, y, z from (2.67) into (2.68) to find what the equation of 
S* will be in the coordinates of Mr. Black. The critical equation turns 
out to be 


(2.681) Æx? + Bly’? + C's’? + D'x'y’ + E'x's! + Fly's! 
+G'x' + Hy + I'2 + J’ =0, 


where 


A! = Aaj, + Bê, + Cal, + Danas + Eana + Faas, 


and formulas for the other coefficients can be written out. Proof that 
the coefficients 4’, B’, C’, D’, E', F’ are not all zero can be based upon 
the fact that substituting the expression for x’, y’, x’ from (2.671) into 
(2.681) must yield the original equation (2.68). If 4’, B’, C’, D’, E', F’ 
were all zero, this substitution would show that 4, B, C, D, E, F are all 
zero, and this is not so. The principle involved is the following: As we 
see from (2.671), a change from coordinates x, y, z to x’, y’, z! cannot 
increase the degree of a polynomial in x, y, z. Moreover, the change 
cannot decrease the degree because, as we see from (2.67), the change 


from x’, y’, x’ back to x, y, z cannot bring a polynomial of lower degree 
back to the original polynomial. 
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So far as we know, some quadric surfaces may be rather complicated 
things, and it is of interest to know what the intersection S; of a quadric 
surface and a plane r may be. Such a set S; is a quadric section. A little 
thought can save us a lot of trouble. We can introduce a coordinate 
system in such a way that the plane = is the plane having the equation 
z =Q. The equation of the quadric surface must have the form (2.68). 
Putting z = 0 in (2.68) shows that the quadric section must be the set 
of points P(x,y) in the xy plane whose coordinates satisfy the equation 


(2.682) Ax? + By? + Dey + Gx + Hy + J =0. 


Quadric surfaces and quadric sections will be studied later. Meanwhile, 
we make some remarks that may be at least partially understood. The 
family of quadric surfaces includes spheres, circular cylinders, circular 
cones, ellipsoids, various kinds of paraboloids and hyperboloids, and, 
in addition, assorted degenerate things such as empty sets, lines, planes, 
and pairs of planes. The equations z? + 1 = 0, x? + y? = 0, z? = 0, 
and z? — 1 = Odo have the form (2.68). The family of quadric sections 
includes circles, parabolas, ellipses, hyperbolas, and, in addition, such 
degenerate things as the empty set, points, lines, pairs of lines, and the 
whole plane. 


Problems 2.69 


1 Supposing that u and v are vectors in £3 having scalar components 1, 
u2 uz and vı 12 93, tell how u X v can be expressed as a determinant. Ans.: 
(2.62). 

2 Calculate w = u X v and check your answer by showing that w-u = 0 
and w-v = 0 when 


(a) u =2i—3j+ 4k, v=2i+ 3j+ 4k 
(b) u = 2i — 3j + —4k, v=2i+3j+4k 
(c) u=i+j, v= i+j+k 
(d) u =i, v= it j 


3 Two unit vectors u; and uz have their tails at the same point P on a 
sphere which we consider to be the surface of an idealized earth. Suppose that 
P is neither the north pole nor the south pole of the earth, that u, points east, 
and that uz points north. Find the direction of u, X Us. Ans.: Up. 

4 Show that the vectors i’ and j’ defined by 


(i — 2j + 2k) 
(2i — j — 2k) 
ai + bj + ck 
constitute a two-dimensional orthonormal system and then so determine a, b, 


c that the three vectors constitute a right-handed three-dimensional orthonormal 


system. Ans.: k’ = 3(2i + 2j + K). 


i’ 
j s 
k’ 
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5 Show that the vectors 


u, = cos @ sin 6i + sin ġ sin 6j + cos 6k 
U2 = cos ġ cos Oi + sin @ cos Oj — sin 6k 
U3 = —sin ġi + cos fj 


in the order Wj, Uz, U3 constitute a right-handed orthonormal system. 
6 If 
OP; = 2i + 3j + 2k 
OP, = 3i + 2j + k 
OP; = 2i + j+ k, 


find a vector orthogonal to the plane containing Pi, Pa, Pa. Hint: The vector v 
defined by v = P,P x P,P; is an answer, and v = —i +j — 2k. 

7 Check the answer to the preceding problem in the following way. Write 
the equation of the plane through Pı orthogonal to v and then show that Pz and 
P} lie in this plane. 

8 Show that the lines having the equations 


x-1l yt+6 z+ 10 x—6 ytl 2+5 


-e SP a 


iy E 2 =1 4 
intersect. Then find equations of the line perpendicular to both at their point 
of intersection. Solution: The vectors OP and 00 running from the origin to 
points P and Q on the two lines are 
OP = (1+2)i+ (—6 + 2)j + (—10 + 3)k 
OQ = (6+ 2u)i + (—1 — u)j + (—5 — 4u)k, 
where ¢ and u are scalars that depend upon P and Q. Equating these vectors 


shows that they coincide when ¢ = 3, x = —1 and hence that the given lines 
intersect at the point R for which 


OR = 4i — k. 
Thus R is the point (4,0,—1). This result is easily checked. The vector 


i j k 
1 2 3| = —5i + 10j — 5k 
2 1 —4 


is orthogonal to vectors on the given lines and hence the equations 


ne nee T pee 


are equations of the required perpendicular line. 
9 Prove that each vector V satisfies the equation 


ix(v Xi) +j X(v Xj) +k x(v xk) = 2v. 
10 Show that (i Xj) Xj+¥i x (j xj). 
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J1 Prove that if u and v and w are vectors having scalar components 2, 
U2, U3 and 01, Ve, V3 and Wi, We, W3, then 
ij K 
(1) ue(v X w) = (uri + uj + usk)-| 9, 92 vs 
WwW; We W3 
and hence 
Ui U2 U3 
(2) u(y Xw) =| n nl. 
Wı Wo W3 


Remark: The number u-(v X w) is called the scalar triple product of the three 
vectors, and we shall see how it is related to volumes of tetrahedrons and paral- 


lelepipeds. Let v and w be nonzero nonparallel vectors having their tails at a 
point 4 as in Figure 2.691. Then 


(3) v Xw = |v| |w] sin On = 2|T,[n, 


where 8 is the angle between v and w, n is the unit normal determined by the 
right-hand rule, and |T2| is the area of the two dimensional triangle Tz of which 


B 


Figure 2.691 


vand w form two sides. Letu bea third vector which has its tail at_4 and makes 
the angle @ with n. In case0 < ¢ < 2/2, the number u-n is the distance from 
the tip of u to the plane of the vectors v and w. The volume F of the tetrahe- 
dron having base T, and opposite vertex B is therefore given by the formula 


(4) y = $(u-n)|T)I. 
Hence 
(5) V = łu (2T) = łu (v X w). 


The volume of the tetrahedron is half the volume of the pyramid whose vertex 
is B and whose base is the parallelogram of which v and w are two adjacent sides. 
The volume of this pyramid is one-third the volume Vp of the parallelepiped of 
which u, v, w are adjacent edges. Therefore, 


(6) Ve=u-(v X w). 


Thus, when 0 < ¢ < 2/2, the scalar triple product u-(v X w) is the volume of 
the parallelepiped of which u, v, w are three adjacent edges. When ¢ = 4/2, 
the vector u lies in the plane of v and w and the scalar triple product is zero. 
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When r/2 < @ < r, the scalar triple product is the negative of the volume of the 
parallelepiped. 

12 Prove that if i, j, k and i’, j’, K’ are two right-handed orthonormal sets 
of vectors, then the determinant of the coefficients in the system 


ai + aij + ai3k 
ani + azj + aK 
azii + azoj + azk 


must be +1. Outline of solution: The hypotheses imply that j’ X k’ = i’ and 
hence that i’-(j’ X k’) = 1. But 


li 


Pepe 
It 


i j k 
i’-(j’ X k’) = (ani + arj + aisk)-| an an az 


| 231 432 433 


= A, 


where A is the determinant of the coefficients, and the result follows. 

13 [tis clear from geometrical considerations that if i, j, k and i’, j’, K’ are 
two right-handed orthonormal sets of vectors in £3 for which k’ = k, then there 
must be an angle @ such that the “new” vectors i’ and j’ are related to the “old” 
vectors i and j as in Figure 2.692. The new vectors and new x’ axis and new yy 


Figure 2.692 


axis are dashed in the figure, and the vectors k and k’ are not shown. It is easy 
to see, with the aid of the figure, that 


(1) ‘== (cos $)i + (sin $)j 
’ = —(sin o)i + (cos ¢)j. 


Check up on this story by using (1) and vector methods to prove that, whatever 
$ may be, it is actually true that ji’] = 1, |j] = 1, j’ = 0, andi’ xj’ = k. 
Solve the equations (1) for i and j to obtain the formulas 


(2) i = (cos ¢)i’ — (sin ¢)j’ 
j = (sin ¢)i’ + (cos )j’ 
which give the new vectors in terms of the old. Observe that changing the sign 


of @ converts one system of equations into the other. Finally, show that if P 
lies in the plane of Figure 2.692 and if 


ll 


Cio pute 


(3) OP = xi + yj = x’! + yj’, 
then 
(4) OP = x't(cos $)i + (sin $)j] + y’[—(sin $)i + (cos $)j] 


= [x’ cos ġ — y’ sin p]i + [x’ sin @ + y’ cos A]j 
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and therefore 


(5) 


x’ cos ġ — y’ sin d 
x’ sin $ + y’ cos ¢. 


I 


y 


Remark: The formulas (5) are, perhaps unwisely, called “formulas for rotation 
of axes” in E: Actually, they are used to convert equations involving “fold” 
coordinates x, y into new (and sometimes simpler) equations involving new 
coordinates x’, y’. 
14 Supposing that u and v are nonzero noncollinear vectors, show that the 

vector 

(uXv) xu 

(u Xv) X ul 


is a unit vector which lies in the plane of u and v and is orthogonal (or perpen- 
dicular) to u. 

15 Cultivate some useful skills by following instructions and paying particu- 
lar attention to steps that seem to be worthy of notice. Draw vectors PP, and 
PP; and then draw the angle ĝ and the unit normal n that appear in the definition 
of PP: X PPa. _ Show that the area 4 of the parallelogram having adjacent sides 
on PP, and PP; is 

A = |PP,| |PP3| sin 6. 


Draw another vector PP, and show that the distance d from P; to the plane of 
PP, and PP; is 
d= +PPyn, 


where the sign is so taken that d 20. ‘Then, depending upon circumstances, 
remember or learn that V = Ad, where V is the volume of the parallelepiped 
having adjacent edges on PP., PP}, PP; if d ¥ 0 and V = 0 if P; lies in the plane 
of PP, and PP;. Use this to show that 


V = +PP,-(PP, X PP). 


Supposing that P has coordinates x, y, z and that the points Pp have coordinates 
Xk Yk, Zk, Show that 
i j k 
V = +(e — x)i + (yı — yj + (Gi — z)k}| x2 — x ye—y te 
x3 — x Ys y Zz3—z3z 
and hence that 
xı — x yl —y n-z 
V = t|x2—x yo-y z2>=z 
xs — x ys — y 2—-% 
Use this to show that 


x y z 1 x y z |I 

y= p| yy zn—=z O}_ sim y a Hf 
xa — x Ye-y zz—z 0 x2 Yo Z% | 

x3 — KX Y3—- VY 237 @ 0 %3 Y3 43 1 
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16 Prove that a line which is not completely contained in a quadric surface 
can intersect the quadric surface at most twice. Solution: Suppose the coordi- 
nate system is so chosen that the given line has the equations y = z = Q, and 
let the equation of the quadric surface be (2.68). A point (x,0,0) then lies on 
the surface if and only if 4x? + Gx + J =0. If there is an x for which this 
equation is not satisfied, then at least one of 4,G, / must be different from 0 and 
there are at most two values of x for which the equation is satisfied. 

17 The purpose of this long problem is to develop ideas about the transversals 
of three given skew (no two lying in the same plane) lines P,;P2, P3Ps, PsP». A 
line Z is called a transversal of the given lines if it intersects the lines PPa, P3P4, 
P;P,at points Q, Qi, Q2as in Figure 2.693. Foreach & = 1,2, - - - , 6 the coordi- 


P, 
Figure 2.693 


nates (xz,¥%,2%) of Px are given numbers and we want information about the 
coordinates of Q, Qı, Qz. The latter coordinates are determined with the aid of 
numbers A, Ai, Ae for which 


(1) PQ = AP Po, PQ: = AiP3Q4; PQ: = APPs 


Our first step is to select a number A (or to think of à as being “‘fixed”) and ask 
whether a transversal through Q exists. If A is so chosen that the plane m, con- 
taining Q, P3, P4 does not intersect the line PsP. or intersects the line PsP¢ at a 
point Qs for which the line QQ» is parallel to the line P3P4, then no transversal 
through Q exists. Henceforth, we suppose that À is not so unhappily chosen. 
A transversal through Q is then obtained by drawing the line L through Q and 
the point Qz where m intersects the line QsQs. The value of Xz can be calculated 
in terms of À from the equation 


xs + Alx — xs) ys + Alys — ys) zs + A(z — zs) 1 

(2) X3 Y3 33 1 
x4 y4 Za l 

xı + Alx: — x) yit Aly — yı) z + A(z — z) 1 

which says that Qz is the point on the line PsP which lies in the plane r, contain- 


ing Pa Pa, Q. Similarly, the value of \; can be calculated in terms of A from the 
equation 


= 0 


x + Ai (xa — X32) V3 + Ailsa — y3) Zz + A (z4 — z3) 1 

(3) X5 V5 25 1 = 0 
xs ye Z6 1 
xı + A@2— x) yi tAGye— y) zi +XA(ze— z) 1 
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which says that Qı is the point on the line P3P,4 which lies in the plane me con- 
taining Ps, Ps, Q. Thus the required coordinates of Q, Qi, Q2 are determined in 
terms of À. Remark: Study of the set S of points P(x,y,z) that lie upon trans- 
versals can be very interesting. If P lies upon the transversal through Q, then, 
for some scalar u, 


(4) OP = (1 — u)OQ + HOO). 

But 

(5) OQ = (1 — \)OP: + AOP, 

and, since (3) shows that there are constants 4 and B for which A; = 4A + B, 
(6) OO: = (1 ~ [4A + BIOP; + (AN + B)OP,. 

Therefore, 


(7) OP = (1—#)(1—d)OPi+ (1 —u)\OP, + uOP; _. 
— p(Ad + B)OP; + w(Ad + B)OP,. 

Hence there are vectors Vj, Ve, V3, V4 such that 

(8) OP = vi + Ave + vs + Apve 


It follows that there are scalars, a), - - -> , ds such that 


gr + bA + ciu + dip 


(9) y az + bA + cou + dap 


Z = A} + bÀ + C3} + dN. 


It can be shown that the equations (9) are parametric equations of a quadric 
surface. In fact, eliminating \ and u from the equations (9) shows that x, y, z 
must satisfy an equation of the form (2.68). Thus S is a quadric surface, and 
we have a quite straightforward procedure for determining its equation in terms 
of the eighteen given coordinates of the six given points Pi, Pa -:-,P. Stu- 
dents who attain full comprehension of this matter will have passed far beyond 
the minimum requirements of this course, and they can find the experience to 
be both enjoyable and beneficial. 

18 Those who wish to extend acquaintance with matrix theory should copy 
the systems of equations in (2.65) and look at them while reading this. Let U 
and UT denote the matrices of the coefficients (or scalar components or direction 
cosines) of the systems so that 


âil G12 413 âil 21 4&3) 
U = 4221 A22 493 |, UT = Qi2 G22 azo jf. 
431 432 433 A13 G23 433 


The matrix UT is called the transpose (or transposed matrix) of the matrix U, 
and this invites us to realize that UT can be obtained from U by transposing 
(interchanging) the rows and columns of U or by transposing the elements of U 
across its main diagonal. The rows of U are scalar components of orthonormal 
vectors, and the matrix is square. Such matrices are called unitary (or ortho- 
normal) matrices. Therefore, Uis unitary. When U is unitary, an application 
of the rule for multiplying matrices shows that UU? = I, where J is the unit 
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matrix. Therefore U~} = UT. This is important; the inverse of a unitary 
matrix U is UT. 

19 This problem requires us to agree with Miss Garnett that methods of 
analytic geometry can be used to solve a challenging problem that may baffle 
those who seek more elementary solutions. It is supposed that a, b, ¢ are positive 
numbers and that the points 4(0,0), B(c,0), C(a,b), D(a + c, b) are vertices of a 
parallelogram. Let a point E(u,0) on the bottom side of the parallelogram be 
joined to the top vertices C and D and let a point F(t,b) on the top side of the 
parallelogram be joined to the bottom vertices 4 and B. The lines EC and F4 
intersect at Pi, and the lines ED and FB intersect at Pz. The line PiP2 meets 
the side AC at Q; and meets the side BD at Q2. The question then arises whether 
the distance from Q; to 4 is equal to the distance from Qz to D. Elementary 
geometrical considerations show that the answer will be affirmative if the line 
P,P, contains the center P; of the parallelogram, this center P3 being the inter- 
section of the diagonals of the parallelogram. Show that, for each k = 1, 2, 3, 
the elements of the kth row of the determinant 


tu bu i 
t+u—a ittu—a 
ac +c? — tu b(c — u) 

at2ce—t—u atde-t—u 
ate b 
2 2 l 


are xz, Yr, 1, where x, and yx are the coordinates of P}. Then prove that the 
determinant is O and hence that the line PP actually does contain P3. 

20 Let Pi(x1,y1), Po(x2,v2), Ps(xs,y3) be vertices of a triangle such that no two 
of the vertices lie on a line through the origin. Let ^i, As, As be three different 
numbers, and for each k = 1, 2, 3, let Q, be the point (Apxz, Axyx). The two tri- 
angles P;P2P3 and Q:Q203 are then perspective, the center of perspectivity being the 
origin. The lines P,P, and Q:Q2 intersect at a point Rs, the lines PaP and Q-Q; 
intersect at a point R, and the lines PsP; and QQ; intersect at a point Re The 
famous Desargues theorem says that the three points Ri, Ra Rslieona line Z. It 
is easy to sketch figures illustrating the theorem, but proofs are not easily origi- 
nated. Possessors of sufficient time, paper, and technique may cultivate addi- 
tional technique by finding the x coordinate of R3, and then interchange x and y 
and advance subscripts to discover that the coordinates of Re and R3are the first 
two elements of the bottom rows of the determinant in the equation 


x y 1 
(As — 1i — (Ar lAs (As — Dy — Ai — 1)ays 1 
As — Ad As — Ai = 0. 
(A; — 1)A2x2 — (A2 _ DA ix: (A; _ l)Ags — (A2 — LA 1 
Ai — As Ar — Ag 


This is, therefore, the equation of the line L through Rz and R3. Considerable 
courage is required to show that the coordinates of R, satisfy this equation and 
thus obtain an analytic proof of the Desargues theorem. 


Functions, 
limits, 


3 


derivatives 


3.1 Functional notation As we progress in a study of a science, it is 
necessary to become familiar with terminology and notation used for con- 
veying information. One of the most important 

mathematical words is the word function. We J 

may look at Figure 3.11, in which x, y, and z are < N 
the lengths of the sides of a triangle and @ is the 4 x 

angle at the vertex V opposite the side of length z, Figure 3.11 
and say that z is a function of x, y, and @ which we 

shall denote by f(x,y,0). By this we mean that when x, y, and 0 are 
given numbers for which x > 0, y > 0, and 0 < 8 < r, the number z 
is completely determined and has a value which we denote by the symbol 
in the right member of the formula 


(3.12) z = f(x,y,6). 
iii 
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This equation is read “‘z equals f of x and y and 0.” It happens that the 
law of cosines, which involves one of the more important formulas which 
should be learned in trigonometry, gives the formula 


(3.13) f(x,y,0) = a/ x? + y? — 2xy cos 0 


from which we can compute f(x,y,0) when x, y, 0 are given numbers. In 
spite of the fact that numbers do not move, it is sometimes a convenience 
to think of x, y, 6, z as being “variables” and to think of z as being the 
“dependent variable” which is a function of the three “independent 
variables” x, y, 6. 

Many examples are more complicated than this, and we can broaden 
our intellectual horizons by thinking briefly about one of them. It is 
standard practice to write 


(3.14) v = f(x,y,z) 
= filx,y,2,2)i + folx,y,2,2)j + folx,y,2,0)k, 


where V, a vector, is the velocity of a fluid (which might be air) at the place 
having rectangular coordinates x, y, z and at ttme?. We say that v and 
its scalar components are functions of the four variables x, y, z, t. We 
mean that when x, y, z are coordinates of a point in the region being con- 
sidered and when ¢ is a time (measured in specified units from a specified 
zero hour) in the time interval being considered, the velocity v and its 
scalar components at that place and time are completely determined and 
that f(x,y,z,2) denotes the velocity and f:(x,y,z,t), fo(x,y,2,t), f3(x,¥,2,¢) 
denote the scalar components. 

There are two useful and more or less modern ways of attaching mean- 
ings to the symbols f and f, appearing in the above example. One is the 
dynamic approach and the other is the static approach. In the dynamic 
approach, f and f; are regarded as operators or transformers (like machines) 
to which we can feed appropriate ordered sets x, y, z, t of numbers. Then 
(after mechanical squeaking or electronic flashing or what not) f and fı 
produce the required vector f(x,y,z,t) and the required number /;(x,y,z,f). 
In the static approach, f is regarded as being the set of ordered quintuples 
(x, Y, Z2, t, £(x,y,z,t)) of four numbers and a vector in which the allowable 
independent variables come first in the appropriate order and the vector 
f(x,y,2,t) comes last. In this static approach, fı is a set of quintuples of 
numbers. It is a common but not universal practice to consider these 
ideas to be more tangible and useful than the idea that f is a law or rule by 
means of which f(x,y,z,¢) can be calculated when x, y, z, t are given. A 
simpler example may partially clarify these matters. As soon as we know 
that the area y of a circular disk is determined by its radius x (x being 
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positive because radii of disks are positive numbers), we can say that y is a 
function of x and write y = g(x). Then g(2) is the area of a disk of radius 
2 and g(2.03) is the area of a disk of radius 2.03. In each case g(x) = rx’. 
We can think of g as being the operator which converts x into rx? when 
x > 0 or as being the set of ordered pairs (x,rx?) for which x > 0. 

It is important to know about a particular special way in which a scalar 
function of one scalar variable can be determined. Suppose we have a 
given set S of ordered pairs (x,y) of numbers such that the set does not 
contain two pairs (1,71) and (%2,j2) for which x: = x; and yo Æ yı. To 
each number xo that appears as the first number in one of the pairs (x,y), 
there is then one and only one number yo such that the pair (%o0,¥0) 
appears in the set. We may let f(xo) denote 
this number yo, and we have yo = f(%o). Thus 
the given set S of ordered pairs (x,y) becomes 
the set of ordered pairs (x, f(x)). When the 
pairs of numbers in the set are associated with 
points and are plotted in the usual way, an 
example being shown in Figure 3.15, the con- 
dition on the ordered pairs means that no two 
points fall on the same vertical line. In the example of the figure, we 
see that f(x) = 2 when x = 0, that f(x) = 1 when 1 Sx <2, that 
f(x) = 2 when x = 2, and that f(x) =x — 2 when2 <x <3. When 
x has a value different from 0 and not in the interval 1 S x <S 3, no 
meaning has been attached to f(x) and we say that f(x) is undefined. 
In this and other cases, the set of values of x for which f(x) is defined is 
called the domain of the function, and the set of values attained by f(x) 
is called the range of the function. All this is perfectly explicit and pre- 
cise, and it should be thoroughly understood by everyone. One who 
wishes to regard f as an operator must realize that each set S of the type 
described above completely determines the number f(x) that f must 
produce when it operates upon a given number x in the domain of f. 
Likewise, one who wishes to regard f as a set S must realize that the opera- 
tor f determines his set S of pairs (x, f(x)). Everyone must realize that a 
set S* of points in a plane endowed with an x, y coordinate system deter- 
mines both the operator f and the set S, provided no two points of S* 
lie on the same vertical line. As sometimes happens in mathematics and 
elsewhere, we have a situation in which different individuals can hold 
different personal preferences. For example, a person who wishes to 
regard f as an operator can take a dim view of the idea that an appropriate 
set S of ordered pairs of numbers “is” a function because it determines a 
function. He can feel that this is too much like saying that a social 
security number “‘is” a worker because it determines a worker, and he can 
object to the idea that social security numbers eat mashed potatoes. 


2 3 x 
Figure 3.15 
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The contraption in the central part of Figure 3.151 is guaranteed to 
make nearly everybody imagine a more or less complicated process by 
which f might operate upon a given input x (an element of the domain of 
f) to produce the corresponding output y (an element of the range of f). 
The last problem of this section provides ideas about functions, operators, 
and transformers that are needed in advanced mathematics and are help- 
ful in elementary mathematics. 


Figure 3.151 


If we know that y is always positive and that x and y are always related 
by the formula x? + y? = 9, we can discover that y = +/9 — x? when 
—3 <x <3. Thus y is determined as a function of x which is defined 
over the interval —3 < x < 3, and the graph is as shown in Figure 3.16. 
Similarly, if we know that y is negative and x? + y? = 9, we can conclude 


that y = —+/9 — x? and we have a function whose graph appears in 
y y 
1 . 
O x 


Figure 3.16 Figure 3.161 Figure 3.162 


Figure 3.161. If we know that x? + y? = 9 but do not know whether y 
is positive œ negative, we cannot determine y in terms of x. The best 
we can do is say that, for each x in the interval —3 < x < 3, y is one or 
the other of 1/9 — x? and — 1/9 — x2. Figure 3.162 shows the graph 
of a function f for which x? + [f(«)]? = 1, it being true that f(x) > 0 for 
some values of x and f(x) < 0 for other values of x. Observe that the 
equation x? + y? = 1 does not, by itself, determine y as a function of x, 
but that there do exist functions f for which x? + [f(x)]? = 1. 

One purpose of all this discussion is to emphasize the fact that our ideas 
about functions must be both broad and precise. We must remain calm 


3.1 Functional notation 115 


when someone says that the temperature u at the north pole of our earth 
is a function of the time ¢ and, without bothering to introduce a new 
letter whose significance must be remembered, uses the symbol x(t) to 
denote the temperature at time t. Many 

problems in pure and applied mathematics 

involve functions about which we have some 

information and seek more. Moreover, we 

must allow ourselves freedom to use standard 

terminology that everyone else uses to convey a %% b x 
ideas and information. We say that a func- 
tion f is increasing over an intervala Sx < b 
if, as Figure 3.163 indicates, f(%1) < f(x) whenever a S xı < xo S b. 
Similarly, f is decreasing over the interval if 


(3.164) f(x) > f(x) (asx <x: <b). 


Figure 3.163 


In this displayed statement, the “whenever” is omitted. The line can be 
read “‘f(x%1) > f(xe) whenever a S xı < x: S b.” If, as Figure 3.163 
indicates, f is increasing over the interval a < x S b and if f(a) = 4 and 
f(b) = B, we say that f(x) increases from 4 to B as x increases from a to b. 
While we use this convenient terminology, we need not be gullible people 
who are easily persuaded that numbers x and f(x) can actually increase. 
To see 6 increase and say hello to 7 as it proceeds toward 8 could be quite 
amusing, but we make no pretense that such things actually happen. To 
avoid misunderstandings, the author wishes to publicly proclaim that 
he is not recommending rejection of the good old terminology; he is 
merely insisting that we know what we mean when we say that y or f(x) 
increases as x increases from a to b. 

Problem 15 at the end of this section deals with a famous number- 
theoretic function. From some points of view, a perfect definition of this 
function can be phrased as follows. Let r be the function whose domain 
is the set of real numbers and which is such that, for each x in the domain, 
a(x) is the number of primes less than or equal tox. This makes the “law” 
or “rule”? concept sound very good. We can easily make the pretense 
that a sufficiently dynamic operator could produce the numbers w(x) that 
we need to form the set S of pairs (x, m(x)) needed for the static concept. 
It will be observed that, in Problem 15, the function is defined in fewer 
words. 

Trigonometric functions and polynomials are simpler examples of func- 
tions that are important in advanced as well as in elementary science. 
A polynomial (or polynomial in x) is a function P having values defined by 


(3.17) P(x) = aox? + aye?! + ++ + + anat + an 
or by 
(3.171) P(x) 


bo + bix + bax? + + + + + bas”, 
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where n and ao, 41, * * * , An and bo, b1, > ° * , bn are constants, n being 
a nonnegative integer. A rational (ratio-nal) function is a quotient of 
polynomials, an example being the function Q for which 


x+1 


(3.172) Q(x) = 2 4+x—12 


for each x for which the denominator is different from zero. When we 
define a function by a formula more or less like (3.172), we ordinarily 
understand that the domain of the function is the whole set of numbers x 
for which the formula actually determines a number. We must, how- 
ever, recognize the fact that the function g for which g(x) = a/x when 
1 < x < 4 is different from the function A for which h(x) = a/x* when 
x = 0; the domains of the functions are different, and the functions are 
therefore different. To clarify this point, we can recognize that a 
machine which is capable of cracking only medium-sized nuts is different 
from a machine that is capable of cracking nuts of all sizes. 

If we have a load of coal of weight w and we toss a lump of coal on or 
off the load, then the new weight will be a new number which we can call 
w + Aw. Thus Aw, which may be either positive or negative, is the 
difference of two weights (the new minus the old). The number Aw, read 
“delta w,” is a single number (not the product of two numbers A and w). 
This simple notational device turns out to be unexpectedly convenient. 
In physical chemistry Ap is the difference of two pressures, Av is the dif- 
ference of two volumes, and At is the difference of two times. In physics, 
Av is the difference of two (vector) velocities, and AV is the difference of 
two potentials. In economics AP is the difference of two prices, and in 


y+ Ay 
Z+ Az 
6+A8 
Vo y L) 
x x+ Ax 
Figure 3.181 Figure 3.182 


biology AP may be the difference of two populations. If, as in the 
discussion involving Figure 3.181, the left member of the formula 


(3.183) z = f(x,y,8) 


represents the length of the side opposite the vertex V, then the left 
member of the formula 


(3.184) z + Az = f(x + Ax, y + Ay, 0 + 40) 


represents the length of the side opposite the vertex V in the triangle of 
Figure 3.182. Of course we can and sometimes shall use shorter symbols 
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such as A, k, p, q for Ax, Ay, A9, Az, but very often the extra labor involved 
in writing the more elaborate delta symbols is a small price to pay for the 
elimination of the superfluous symbols whose meanings may be forgotten 
and confused. 

At the conclusion of the text of this section, the author makes some 
remarks that he would have made at the beginning if he had thought that 
they could have been understood. The old word “function” has been 
and is and will be used in many different ways. Students who get around 
will have serious difficulties unless they are so well informed and tolerant 
that they can accumulate and dispense information by reading and hear- 
ing and talking quite different languages. It is like being able to play 
football with those who play football and to play basketball with those 
who play basketball; one who knows only ping-pong is sometimes handi- 
capped. This, of course, does not imply that a particular teacher is 
required to stand by while many different games are played simultane- 
ously in his classroom. Each individual teacher may, with the full back- 
ing of the author, go as far as he likes in prescribing the rules of the game 
to be played in his own classroom. 


Problems 3.19 
1 If 
fe) =x, ge) =x, A) = 27, Oe) =] z TI 


verify the following assertions and replace the question marks by appropriate 
answers. 


(a) fO) = 0, f(—3) = —3, f(2) = 

(b) g(0) = 0, g(—2) = 4, g(5) = 

(c) A(—1) = %, A(O0) = 1, A(2) = 4, h(s) = 1.4142 
(d) h(5) = ?, hA(—2) = ?, A(—2) =? 

(e) $È) = $, $2) = 2, 6(—2) = ?, (č) =? 
(f) (8) — f(5) = 3, f(2.1) — f(2) = 

(g) g(3) — g(2) = 5, g(2.1) — g(2) = 

(h) h(3) — h(2) = 4, A(1) — 20) =! 

(i) (1.1) — (1) = —.0475, (0.2) — $0) = 
f(4. ie — 1, £2: D fe. 7) _ > 


(7) 
(h) cs) -w = 8.1, P aE) —? 
0) (2. D- — $(2) _ —.152, s0. 2- (0) _ > 


m) EED 405) — ee) _, 
0.5 ° 


2x + 2, È 


(a) g(l + Ae) — g(1) Ax, g(s) — g(0) _ > 


=2+ 
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( ETAD AA _ p BEHAD =H) oe pay 
de + Ax) — d(x) 2x Ae 
$) Ax ~ +2 + & Fae 


2 The signum function having values sgn x (read signum x, almost like sine x) 
is defined by the formula 


senx= |l (x > 0) 
ssnx= 0 (x = 0) 
sgn x = —l (x < 0). 


Show that Figure 3.191 displays the graph of sgn x and 
then draw the graph of sgn (x — 2). Show that |x| = x 
sgn x. Hint: Consider separately the cases in which 


Figure 3.191 x >0,x =0,andx <0. 
3 The Heaviside (1850-1925) unit function having 
y values defined by 
1 H(x) = 1 (x > 0) 
H(x) = 4 (x = 0) 
H(x) = 0 (x < 0) 


is named for the mighty electrical engineer who popu- 
x larized its use. Show that Figure 3.192 displays the 


Figure 3.192 graph of H(x) and then draw the graph of H(x — 2). 
i Show that 
y A(x) = it sen, sen x = 2H(x) — 1. 
1 e 


4 We need more evidence that not all functions 
have simple graphs that are easily drawn. Let D be 
the dizzy dancer function, defined over the interval 

Ol ees 0 = x < 1, for which 


. D(x) = 0 (x irrational) 
Figure 3.193 D(x) = 1 (x rational). 


Think about this matter and acquire the ability to make a figure more or less 
like Figure 3.193 to “represent” the graph of D. 
5 A function g is defined by the formulas 


g(x) = x? O <x!) 
g(x) =x (otherwise). 
Plot its graph. 

6 <A function f is said to be an even function if f(—x) = f(x) whenever x 
belongs to the domain of f and is said to be an odd function if f(—x) = —f(x) 
whenever x belongs to the domain of f. Prove that the polynomial having values 
2 — 3x? + 5x4 (with only even exponents appearing) is even. Prove that the 
polynomial having values x — 7x* + 2x7 (with only odd exponents appearing) 


is odd. Prove that the polynomial having values 1 — 2x + 3x? is neither even 
nor odd. 
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7 If h(x) = x + 1/x when x Æ 0, show that h(1/s) = k(t) when ¢ ¥ 0 and 
that [h(x)]? = h(x?) + 2. Work out a formula for A(A(x)) and check the formula 
by setting x = 2. 

8 If f(x) =x? + 3x+1, show that f(—3) = 1, f(—-1) = —1, f(0) = 1, 
A) = Y, f(2) = 11, and 


f(x + Ax) = x? + 3x +14 (2x + 3) Ax + Ax? 


when Ax? means (Ax)”. It is quite appropriate to use this formula as a basis 

for a feeling that, when x has a particular fixed value such as 0 or —2 or r, the 

value of f(x + Ax) is nearly the same as the value of f(x) whenever Ax is nearly 0. 
9 If f(x) = mx + b, show that 


f(x2) — fw) _ ,, 


x2 — Xi 


whenever x: = xı. Sketch a figure and comment upon the result. 
10 If f(x) = x?, show that 


feth -Ie 54, 
h 


when, as always when we make calculations of this kind, k #0. Sketch a 
graph of the function and use the above formula to find the slope of the line L 
passing through the two points on the graph for which x = 1 and x = 1.001. 
The answer is 2.001, and it is quite appropriate to have a feeling that this is 
nearly the slope of the line tangent to the graph at the point (1,1). 

11 If f(x) = x?, simplify 


fœ + Ax) + f — Ax) — 2f(), 
Ax? 


12 If f(x) = 1/x, and if x and x + Ax are both different from 0, show that 


f(x tAx)—-f)_ O ol i 
Ax x(x + Ax) 


13 Make appropriate use of the trigonometric formulas 


sin (a + 6) = sin & cos 8 + cosa sin 8 
cos (a + 8) = cos acos B — sina sin B 


to obtain the formulas 


sin (x + h) — sin x _ sinh i, —!: — cosh... 

h h h 
cos (x +h) — cosx sink . _l— cosh 
— s z sins — ——>—— cosx. 


14 Show that y will be a function of x for which 


x? + xy(x) + fy(x)}? = 3 
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if —2 S x S 2 and, for each such x, y(x) is one or the other of the two numbers 


=x — V3(4 — x?) =x + VB — x?) 
g 2 2 | 


which are equal only when x = —2 and when x = 2. Hint: Use the quadratic 
formula. 

15 An integer n greater than 1 is said to be composite if, like 39, it is repre- 
sentable as the product of two integers each greater than 1 and is said to be a 
prime if, like 29, it is not so representable. ‘The primes are 2, 3, 5, 7, 11, 13, 17, 
19, 23, 29, - - - , there being an infinite set of them. One of the famous func- 
tions of number theory is t(x), the number of primes less than or equal to x. 
It is easy to see that (8.27) = 4. It has been proved that (10%) = 168, 
(10°) = 78,498, and 2(10°) = 50,847,478. To graph w(x) over the whole inter- 
val O S x < 10° would be quite a chore. However, draw the graph over a 
shorter interval, say 0 < x < 40, and try to pick up some ideas. 

16 Another famous number-theoretic function has, for each positive integer 
n, the value d(n), where d(n) is the number of positive integer divisors (including 
l and n) of n. For example, the divisors of 6 are 1, 2, 3, and 6. Verify the 
entries in the little table 


11 12 1 


n=123456789 10 11 12 13 
dim) =122324243 4 2 6 2 
and calculate d(233752). 

17 We take a brief preliminary look at some functions that play fundamental 
“ roles in physics, mechanics, and statistics. Let n bea positive integer. For each 
k =1, 2, 3, -> , n, let a particle PX of mass mą be concentrated at the point 
Px(xz,yx)- In what follows we use £ (xi, the Greek x) to denote a number which 
can easily be considered to be the x coordinate of a point, and we use M with a 
superscript to make us think of a moment. For each number &, the number 


M2, defined by 
(1) MY, = my(x1 — $) + m(x2 — E) + > >> + malta — $) 


is called the first moment of the mass system about the line having the equation 
x = & Supposing that the total mass 


(2) M = mtm- Hm 


of the system is positive, we can put (1) in the form 


1) mıxı + mexa H °° © Mtn _ . 
©) M2 = M(™ M ein — £) 


Similarly, for each number 7 (eta) the number M{2, defined by 
(4) My, = myi — 2) + mya — 9) + + > + Malya — n) 


is called the first moment of the mass system about the line having the equation 
y =n, and 


miyi + 7° nyn 
(5) Mp, = M (nteni Emy _ 5), 
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The particular point (£7) for which M®, = 0 and MP, = 0 is called the cen- 
troidt (thing like a center) of the mass system. The coordinates of the centroid 
are denoted by ž and ý. Thus, M®; = 0 and Mp, = Q0, and it follows from (3) 


and (5) that 


= _ mxi + mx: + ++ + + Matn Ga MY t my:t >te + may 
(6) z= MMT mote Tt mets, ga , 
In case m, = 1 for each k, (2) shows that M = n and the formulas (6) reduce to 
(7) ge Meet ten pa Eyt ty 
n n 


In this case the centroid (#,%) is called the centroid of the set of points Pi, Pa, 
- + t, Pa. To prepare us for Section 4.7 and other sections where less simple 
mass systems are considered, we should take brief cognizance of a more general 
definition. Let p be a nonnegative integer. The number Af{?), is defined by 


zæ 
(8) MP, = m(x — E)? + m(x: — EP + + > + + mlan — §)P 


and is called the pth moment of the mass system about the line having the equa- 
tion x = $. Similarly, the number M‘?, defined by 


(9) MR, = my — 9)? + my2 — 9)? +++ + mlyna — 1)? 


is called the pth moment of the mass system about the line having the equation 
y =n. In physics and mechanics (but not so often in statistics) the second 
moment is called moment of inertia. Since we are studying functions, we can 
observe that, if our mass system contains 40 particles, there is a sense in which 
the moments in (8) and (9) are functions of 82 variables of which two are p and &. 
While this textbook does not require calculations of these moments, we can recog- 
nize that there are many situations in which calculations must be made, and this 
is one of the reasons why the world contains so many calculating machines and 
computers of assorted mechanical and electronic varieties. 


18 Iff(x)=1 +x +x? + x + x4, show that f(1) = 5 and 


when x ¥ 1. 
19 Remark: This remark invites more complete comprehension of ideas and 
terminology involving functions. The left-hand part of Figure 3.194 represents 


Back bor L poy amg 


Figure 3.194 


a set D of numbers or of vectors or of entities of some other kind that is called a 
domain. The central part of the figure represents a mechanism that is sometimes 


t We are being rather unrealistic if we suppose that everybody always chooses the same 
coordinate system when studying a given system of particles. The coordinates of the cen- 
troid depend upon the coordinate system used, but (we omit the proof) the location of the 
centroid relative to the system of particles is the same for all coordinate systems. For 
example, if three particles of equal mass lie at the vertices of a triangle, then the centroid 
lies at the intersection of the medians of the triangle. 
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called a black box and is sometimes called a transformer T. When an element x 
of the domain is selected and fed into the black box or transformer, the x is called 
an input and the black box or transformer is supposed to produce an output 
which is an element y of a set R which is called a range. Thus to each x in D 
there corresponds exactly one y in R which is called the transform of x and is 
denoted by T(x) so that y = T(x). Thus we have a transformer T which trans- 
forms each x in D into a transform T(x) in R. So far we have used the words 
“transformer” and “transform,” but we have not used the word “transformation.” 
Our domain and transformer and range determine and are determined by the 
set S of ordered pairs (x,y) for which x is an element of D and y is the element of 
R for which y = T(x), and we call this set S a transformation Ts. The domain 
(set of inputs) and range (set of outputs) of the transformer T are also the domain 
and range of the transformation Ts. We now have adequate terminology and 
notation. The transformer T is the active “operator” that converts each ele- 
ment x of D (or each first element of one of the pairs in Ts) into the transform 
T(x) in R (which is the appropriate second element of a pair in the set S which 
constitutes Ts). The transformer T and the transformation Ts are inherently 
different things, and there can be no doubt that our science is inadequately 
developed when we apply the same name and the same symbol to the two things. 
The worst of it is that, when the word ‘‘function”’ is used, this one word sometimes 
means a transform T(x), sometimes means the transformer T, and sometimes 
means the transformation Ts. Perhaps an assertion involving the word “func- 
tion” will help us to see why we must make a rather serious study of terminology 
before we can be intelligent readers and listeners. It is the function (see the 
nonmathematical meanings given in a dictionary) of a function (transformer) 
to carry an element of the domain D of the function (transformer or transforma- 
tion) into the function (transform) in the range R of the function (transformer 
or transformation). Commenting upon this matter from the point of view of 
mathematical logic, Professor Rosser remarked to the author that some of our 
terminological difficulties are due to the fact that the already overburdened 
old word ‘“‘function” was used as a name for the set S of ordered pairs. It is 
possible that terminology will slowly improve, but meanwhile we can be com- 
forted by the fact that the bad terminology rarely if ever actually injures us. 
We can be irked by the fact that a “diameter” of a circle is sometimes a line 
segment (a point set) and is sometimes a number (the length of the line segment), 


but we are rarely if ever injured and there seems to be no overwhelming demand 
for improvement of the terminology. 


3.2 Limits When we were infants learning to walk and to talk, and 
perhaps even after that, we heard many statements that we could not 
comprehend. When an explorer tells us that he found a complete set of 
normalized Legendre polynomials in an ancient cave in Peru and that the 
carbon test shows that the set is 24,500 years old, it may be difficult for us 
to learn what he is talking about and whether he is telling a truth. 
Moreover, statements involving erudite technical terminology are not the 
only ones that can be troublesome. Sometimes we must do considerable 
working and thinking before we can fully understand statements that 


3.2 Limits 123 


involve only simple words and may seem, at first sight, to be childishly 
simple. 

It is reasonable to suppose that the harangue of the previous paragraph 
is leading up to something, and that the lightning is about to strike. It 
is. We are going to undertake to make a sane appraisal of the assertion 


(3.21) x? is near 9 whenever x is near 3 but x ¥ 3 


which we shall call the assertion in the first box. The assertion does not 
say anything about the value of x? when x = 3. It does not say that 
x? is 9 when x = 3, and hence it does not pretend to tell the whole truth. 
There is a fundamental reason why it is not completely easy to tell what 
the assertion does mean. The reason is that it simply does not make 
precise mathematical sense to say that a number x is near 3. Whether 
416 or 4 or 3.01 or 3.00001 or 2.98 is considered to be near 3 or not can be a 
matter of opinion and can depend upon circumstances. Likewise, it 
does not make precise mathematical sense to say that x? is near 9. 
Discouraging as this may be, we must recognize that it may be possible 
to attach a precise meaning to the assertion in the first box without attach- 
ing meanings to the “assertions” x is near 3 and x? is near 9. After all, 
the word “attaching’’? can mean something even when “atta”? and 
“ching” donot. It should be possible to tell precisely what the assertion 
does mean, because the assertion uses words in a thoroughly serious 
attempt to convey information. A fundamental idea is involved. 

Our first attempt to make sense out of (3.21) is to replace it by the 
assertion 


23 . . j 
(3.211) x? is a good approximation to 9 whenever x is 


a good approximation to 3 but x # 3 


in the second box. This change in the wording can be psychologically 
satisfying, and we started with (3.21) only because it is shorter than 
(3.211). We have not, however, conquered our fundamental difficulty, 
because the statement that one number is a good approximation to 
another is neither more nor less illuminating than the statement that one 
is near the other. 

It is a remarkable-fact that much of the mathematical progress of the 
past century is based upon the development and use of a particular special 
method of attaching meaning to the statements in the first two boxes. 
The method is called the epsilon-delta method because it traditionally 
employs the two Greek letters e (epsilon) and ô (delta). The meaning of 
the assertions in the first two boxes is, by this method, defined to be the 
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same as that of the assertion 


To each positive number e there corresponds 


(3.22) a positive number ô such that 
lx? — 9| < e whenever 0 < |x — 3| < 6 


which we shall call the epsilon-delta assertion.f When we first see the 
epsilon-delta assertion, we are entitled to feel that it lacks the intuitional 
appeal of the preceding assertions, but it turns out to be the fully mean- 
ingful assertion which can be proved if it is true and can be disproved if it 
is false. 

Before further discussion of the assertions in the boxes, we can note 
that they are so long that it is tedious to write them very often and that 
they are universally abbreviated by the efficient and effective shorthand 


Ud 


(3.23) 


in our fourth and final box. Thus the assertions in the four bozes are 
equivalent; if one is true, then all four are true; and if one is false, then all 
four are false. They all mean the same thing. 

The only possible discordant phrase in the symphony is the noise we 
make when we read the assertion in the last box. We say that the limit 
as x approaches 3 of x2 is 9. Thus we have another technical statement 
couched in terms of the dubious concept of moving numbers. Stephen 
Leacock (1869-1944) was wise enough to realize that if a number x really 
could approach 3 from more than one direction, then it should be able to 
reverse the process and go away from 3 in more than one direction. In 
any case, Leacock enabled Lord Ronald (a character in “Nonsense Novels, 
Gertrude the Governess: or, Simple Seventeen”) to fling himself upon his 
horse and ride madly off in all directions. We make no attempt to explain 
our basic concept in terms of moving numbers. Such attempts are much 
too mystic and vague for advanced technical books, and we can hold the 
view that they are at least a little bit.too mystic and vague for elementary 
books. In our book, the collection of words “the limit as x approaches 
3 of x? is 9” does not suggest that numbers jump around; it suggests that 
“x? is near 9 whenever x is near 3 but x is different from 3,” and this 
basic concept is made precise by the epsilon-delta assertion. 


f In this and similar assertions, we avoid difficulties by using the word “‘each” in prefer- 
ence to “any” because the troublesome word “any” often means “‘at least one.” Residents 
of Los Angeles can be expected to give lusty affirmative answers when asked whether any 
major city of the United States lies west of the Mississippi River. Ife < eand |x? — 9| < 
€,, then |x? — 9| < e. For this reason the epsilon-delta assertion will be true if to each e for 
which 0 < e < 0.001 there corresponds a positive number ô such that |x? — 9| < € 
whenever 0 < |x — 3| < 8. 
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It is both easy and customary to adopt the absurd view that everybody 
has spent huge amounts of time squaring all sorts of numbers near 3 and 
has somehow picked up positive knowledge that the assertions in the 
boxes are true. Instead of trying to discover how uncertain we should be, 
we eliminate uncertainties by proving the epsilon-delta assertion. 
Lete> 0. If 0 <6 < land 6 <é/7, then 


(3.24) |x? — 9] = |æ + 3) — 3)| 
= |x + 3| |x — 3| < 7lx — 3| < 75 < e 


whenever 0 < |x ~ 3| <6. To obtain the inequality |x + 3| <7 
which was used in (3.24), we can use an appropriate figure or, alterna- 
tively, use the fact that if |x — 3| < 6 and 6 < 1, then 


ln + 3| =|x --3 +6 S| -—3/ 46-546 <7. 


Thus an appropriate 6 can be found and the assertion is true. It is not 
inappropriate to think about this matter for a few minutes or perhaps 
longer. f 
We are familiar with the nature of the graph 

of the equation y = x, and it is comforting to  9+e 
see that the epsilon-delta assertion has a simple 
geometric interpretation. It says that if e > 0 
and if horizontal lines are drawn through the 
points with y coordinates 9 — e and 9+ e asin 
Figure 3.241, then there exist vertical lines 
(dotted in the figure) such that, with the possible 
exception of a single point for which x = 3, the 
part of the graph between the vertical lines is also 
between the horizontal lines. The little sister we 


=m soen un GUD ue QED mn AEE OED OED TEP Ome an oe ow 


mention occasionally might be irked by the pos- 2 
sible exceptional point, but she certainly would 3-5 3+ô 
be clever enough to put in the dotted lines after Figure 3.241 


we had shown her a figure containing the hori- 

zontal lines; the process is thoroughly elementary and we need not 
require that efforts be made to seek the greatest ô that serves the pur- 
pose. Even though it does not make precise mathematical sense to say 


t The famous flea assertion “each flea has a smaller flea to bite him” is, in some respects, 
similar to the epsilon-delta assertion. We recognize that the “each flea” at the front of the 
assertion invites us to think about fleas one at a time, not every flea or all fleas atonce. To 
prove the flea assertion, we would be required to start with a given flea, say Mr. F. (who 
could be any flea but would not be every flea or the collection of all fleas), and show that 
there is a smaller flea which is so related to Mr. F. (and which may be said to correspond 
to Mr. F.) that it bites him. To prove the epsilon-delta assertion, it suffices to start with a 
given positive number e (which could be 416 or 4 or 0.00001 or any other positive number 
but naturally cannot be all of these things at once) and then show that there is a positive 
number ô so related to e that |x? — 9| < e whenever x = 3 and |x — 3| < ô. 
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that e is small, we need not deny ourselves the satisfaction of the feeling 
that, when the given e is small, the dotted lines must be close together 
and the 6 must be small. 

For the case in which f(x) = x? and a = 3, we have been discussing 
questions involving values of f(x) when x is near a. Our serious interest 
often lies in such questions when f(x) has a more complicated expression, 
say one of 

1 V2 +x — v2 sin x 
x x nr ae 


(1 + x)", 


We should therefore know that the assertions in the four boxes 


(3.25) f(x) is near L whenever x is near a but x ¥ a. 


(3.251) f(x) is a good approximation to L whenever 
x is a good approximation to a but x ¥ a. 


(3.26) To each e > 0 there corresponds a ô > O such that 


\f(x) — L| < e whenever 0 < |x — al < ô. 


have identical meanings. When we have plenty of time, we can always 
replace the epsilon-delta assertion by the following more ponderous but 
psychologically satisfying one. To each positive number e there corre- 
sponds a positive number ô such that f(x) approximates L so closely that 
If) — L| < whenever x is different from a but approximates a so 
closely that |x — a| < ô The assertion (3.27) is read “the limit as x 
approaches a of f(x) is L.” 


If f and a are such that there is no L for which the four assertions are 
true, then we say that 


lim f(x) 


does not exist. Complete comprehension of this matter is essential: 
otherwise, we must be eternally confused by a statement that a thing at 
which we are looking does not exist. 

Some assertions involving limits are not completely simple. ‘There 


will come a day when we must know there is a number e, having the 
approximate value in 


(3.271) e = 2.71828 18284 59045, 
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such that 


(3.272) lim (1 + x)? = e. 
2—0 


Anybody can collect a little evidence in support of this assertion by mak- 
ing calculations when x has such values as +4, +4, +4, and +4, but 
it is not so easy to prove the assertion. In fact we must have very sub- 
stantial information about limits before we can, in Chapter 9, define 
functions having values e7 and, when x > 0, logx. Meanwhile, many of 
the problems that confront us will be solved very quickly and easily with 
the aid of the following fundamental theorems. We call them limit 
theorems, but they are nothing but basic theorems in the theory of 
approximation. 


Theorem 3.281 If 
lim f(x) = Ly, lim f(x) = Le, 


then Le = Lı. 
Theorem 3.282 If bis a constant, then 
lim b = b. 
Theorem 3.283 
lim x = a. 


Theorem 3.284 If ¢ 1s a constant, then 
lim cf(x) = c lim f(x) 


provided the limit on the right exists. 
Theorem 3.285 The formulas 


lim [f(2) + g(2)] = lim fe) + lim g(a) 
lim [f(@)g(@)] = (lim folim g 


za 


, fe) _ RI 


moa g(x) lim g(x) 


are valid provided the limits on the right exist and, in the case of the last 
formula, lim g(x) = 0. 
zZ—a 
Theorem 3.286 If 
lim f(x) =L 
then 
lim |f) — L| = 0 
2a 


and conversely. 
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Theorem 3.287 (sandwich theorem or flyswatter theorem) Zf for 
some positive number p 


g(x) S f(x) S h(x) 
whena — p <x <aandwhena< x < a+ p, and if 


lim g(x) = L, lim A(x) = L, 


wa 


then 
lim f(x) = L. 


Theorem 3.288 If pis a constant positive exponent, thent the first of 
the formulas 


lim x? = a?, lim x? = a? 
ta z—-a+ 


holds when a > Q and the second holds whena = 0. 

These theorems are easily understood and will turn out to be very use- 
ful. Unless his teacher rules otherwise, each individual student has three 
options. He can claim that the theorems are so obvious that they do not 
need proof and, even though this is surely a precarious way to start a 
successful mathematical career, he may even be right. He can claim that 
they are not obviously true but he will accept them because they are 
printed and the teacher says there are no misprints. Finally, he may 
want to see proof because he is suspicious or inquisitive or wants to 
develop abilities to prove things. In the latter case he may attack 
Appendix I at the end of this book. Whatever we do, we should always 
believe that if f(x) lies between g(x) and h(x) and if g(x) and A(x) are both 
near L whenever x is near a but x = a, then f(x) must be near L whenever 
xis near a but x #a. This is what the sandwich theorem says, and the 
meanings of the other theorems are also simple. 

The first two of the following problems are designed to promote under- 
standing of the epsilon-delta assertion (3.26). Wemust always remember 
that if the epsilon-delta assertion is true, then to each (not all or every) 
epsilon that is positive there corresponds a delta that is positive such that 
\ f(x) — L| < e whenever x is different from a but so near a that |x — al < ô. 
It is not asserted that there is a delta which corresponds to every epsilon. 


It is asserted that to each epsilon there corresponds a delta. The epsilon 
comes first, and the delta follows. 


t The meaning of the second of these statements is explained in Section 3.3. The 


theorem is, as Appendix 1 says, proved in Chapter 9 after the theory of exponentials and 
logarithms has been developed. See Theorem 9.271. 
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Problems 3.29 


1 It is not enough to be able to read the four assertions which involve f(x) 
when x is near a but x # a. We must be able to write them. Try to write 
them with the text out of sight and, if unsuccessful, read the text some more and 
try again. 

2 In terms of epsilons and deltas, write a complete statement giving the 
exact meaning of each of the following true statements: 


(a) x? is near 27 whenever x is near 3 but x # 3. 

(b) sin x is near 0 whenever x is near 0 and x # 0. 
sin x 

c 

o © 


(d) i= S08 is near 0 whenever x is near 0 and x ~ 0. 


is near 1 whenever x is near 0 and x = 0. 


(e) x is near 1 whenever x is near l and x # 1. 


(P) tim $ -1 (g) lim LZS Lo 
z70 %* z—0 
. l— cosx 1 ~ 4 
—— T am = 2 
(h) lim x? 2 ) lim e ° 
(j) lim (1 + x)? = e (k) lim ZZ! = 1 
z—0 z—0 x 
(1) lim sin (x + Ax) — sine _ cos x 
Az—0 Ax 
(m) lim cos (x + Ax) — cosx _ -sin x 
Az—0 Ax 
. nxt — (n+ 1j +1 _ n(n4+ 1) 
(m) im @-l? ` 27 


Answer to last part: To each e > 0 corresponds a ô > O such that 


nxt — (n +1) +1 _n(n+1) <e 
(x — 1)? 2 


whenever 0 < |x — 1[ < ô. 

3 The first formula of Theorem 3.285 assures us that if f(x) is near 3 and 
g(x) is near 5 when x is near a but x ¥ a, then f(x) + g(x) is near 8 whenever x 
is near a but x ~ a. Give similar applications of the other two formulas in the 
theorem. 

4 Tell whether you would like to learn and use new notation by which one 
or the other of the “formulas” 

(1) approx f(x) =L, approx f(x) = L 

€,0<|z—al<é z~a 
is used to abbreviate the epsilon-delta assertion: to each positive number € 
there corresponds a positive number ô such that f(x) approximates L so closely 
that | f(x) — L| < e whenever 0 < |x — a| < 6. If you have no opinion, think 
about the matter and get one. Remark: A person who thinks that this is a 
silly question may be thoroughly mistaken. It is not unreasonable to suppose 
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that scientists of the future will adopt notation like that in (1) and that their 
historians will wonder why on earth people ever concocted tales about moving 
numbers and converted a few basic theorems in the theory of approximation 
into a mystic “theory of limits”? that kept the world agog for several centuries. 
In our book, the theory of limits sometimes sounds like a theory of moving 
numbers but it is in fact a part of the theory of approximation. Let us get on 
with it. 

5 Verify the following assertions and replace the question marks by appro- 
priate answers. The basic limit theorems may be used. 


(a) lim 3x = 15 (b) lim 3x = ? 
25 2-2 
(c) lim (3 — 2x) = 3 (d) lim (4x — 5) = 
z—0 z—0 
(e) lim (y + 1)(y + 2) = 20 (f) lim (c +2) =? 
x+1_ 21 —-2_, 
(g) tim S224 = 3] (A) lim 35 z2 + rae £ 


6 Pay very close attention to the problem of evaluating 

. V2+h-vV2 

lim ————— 

h0 h 
because the process involves some troublesome points. Tell why the last part 
of Theorem 3.285 cannot be used here. Look at the problem and observe that 
it is difficult or impossible to guess what the answer (if any) is. Observe that 
we must put the quotient in a more manageable form before we can find its limit. 
The next step is to remember from experiences in algebra or to learn right now 
that the numerator and denominator of the quotient should be multiplied by the 
“conjugate” of the numerator. Thus 


lim YV2ERh- V2 1 V2 th- V2 V24h4+ V2 
im <= a L 
h->0 h lim h V22F h+ v2 
= lim 2+h-—2 lim __! 
~ ORD EA + VD = lim 272 
Tell which of the theorems of this section are used in making the last step. To 


be sure that this process is thoroughly understood, make the small notational 
adjustments necessary to obtain the formula 


Ax—0 Ax 2-Sx 


(x > 0). 


Put in at least as many steps as appear in the special case. 
7 Supposing that a > 0, show that 


, x 
lim —== = 2a. 
20 Vx +a?—a 


8 Show that 
lim VIt l o. 


z—0 Po 
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9 Supposing that y = x? and y + Ay = (x + Ax)?, show that 


] 22 = 
AsO Ax ox 
10 Prove that 
lim (+ Ax)? ~ x? 3x? 
åzr—=0 Ax 
11 Prove that, when x > 0, 
im VE tA — Vx _ 3 x 
Az—0 Ax 2 ` 
12 We have shown that 
sin (x +h) — sin x _ sin h osx l — cos A iny 
h r a i 
and we shall learn that 
i; sinh _ | i 1 — cosh 
hoo 0 h 0 
Use these facts to find that 
lim sn ef Ho sins = COS Xx. 


13 Supposing that y ¥ 0, prove that 


lim Z y? = —] 
lim + y? 


14 Supposing that y = 0, prove that 
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15 Prove that if lim f(x) = L, then to each positive number e there corre- 


sponds a positive number ô such that 


[f(2) — f(xi)| < € 


whenever 0 < |x2 — al < d and O < |x, — aj < ô. Remark: Proof of this result 
depends upon the idea that it two things are near the same place, then the things 


must be near each other. The details require careful attention, however. 


To 


prove the result, let e be a positive number. Then ¢€/2 is a positive number. 
Hence there is a positive number 6 such that |f(x) — L| < €/2 whenever 0 < 


|x — al < 6. Therefore, 
Fed — fle) = e — 2] e) = L 
S |fe) -L + e -L<$+5 6 


whenever 0 < |x, — al < ô and 0 < |x: — al < ô. 
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16 Recall that the signum function having values sgn x (read signum x) is 
defined by the formula 


senx = 1 (x > 0) 
sen x = 0 (x = 0) 
sgn x = —] (x < 0). 
Show that 
lim sgn x 
z—0 


does not exist. Solution: To prove this without the aid of the result of Problem 
15, we let f(x) = sgn x and prove that there is no number L for which the epsilon- 
delta assertion is true. Todo this we assume (intending to show that the assump- 
tion must be false) that there is a number L for which the assertion is true. Let 
e be a number for which 0 < e < 1, and let 6 be a corresponding positive number 
such that | f(x) — L| < e whenever 0 < |x| < 6. If 0 <x < 6, then f(x) =] 
and hence |1 — L| < e If -5 < x <0, then f(x) = —landhence|—1 ~ L| < 
e. Therefore, 


2=|l+1|=]1-L+1+L| |l- L+] +L] < 26 


and hence e > 1. This contradicts the inequality e < 1 and establishes our 
result. 
17 Show that if f(x) = |x|, then 
lim 109 + 2) — £0) 
h—0 h 
does not exist. Solution: Let g(h) denote the above quotient. When h} > 0, 
we find that g(h) = h/h = 1, and when 4 < 0, we find that g(h) = —h/h = —1. 
The result then follows from the preceding problem. 
18 Prove that the first of the assertions 


lim x? = 4, lim x? = 5 (?) 
1—2 1—2 


is true and that the second is false. 
19 If Dis the dizzy dancer function for which 


D(x) = 0 (x irrational) 
D(x) = 1 (x rational), 


prove that there is no a for which lim D(x) exists. 
>a 


20 Suppose that, in some vast universe, it really is true that each flea has a 
smaller flea to bite him. Suppose also that the universe contains at least one 
flea. Do these hypotheses imply that there exist fleas having mass less than 
I milligram? Ans.: No. The hypotheses would be satisfied if to each positive 
integer n there corresponds a flea whose mass in milligrams is 1 + 1/n, and the 


flea of mass 1 + 1/n is bitten by the flea of mass 1 + 24 z 1 
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3.3 Unilateral limits and asymptotes When we are talking about 
the function f for which f(x) = sgn (x — a) and see the graph shown in 
Figure 3.31, and in some other cases as well, 

we can cheerfully assert that Ag (lambda sub _——— 
R) is a number such that f(x) is near Ag when- 
ever x is near a and x > a. We can feel sure 1% x 
that we know the meaning of the assertion, but “AZ 

we must nevertheless know that the epsilon- Figure 3.31 
delta version of the assertion is the following. 

To each e > O there corresponds a 6 >0 such that |f(x) — Ar| < € 
whenevera < x < a+ ô This time the condition x ¥ a does not enter 
the assertion to bother our little sister and everything is very simple. 
The abbreviated version of the assertion is 


(3.32) lim f(x) = Xp. 

w—a-+ 
The new thing in this symbol is the plus sign that follows the a. Perhaps 
the best way to read this is “the right-hand limit as x approaches a of 
f(x) is Arg,” but it is always awkward to write one thing and say another, 
so the reading usually boils down to “the limit as x approaches a plus of 
f(x) is Ar.” In case there is no number for which the assertion is valid, 
we say that the right-hand limit does not exist. A similar succession of 
ideas leads to the symbol 
(3.321) lim f(x) = Ax, 

Toa — 
which says that the left-hand limit as x approaches a of f(x) is Az. 


If a function f and a number xo are such that the unilateral limits Az 
and Az in 


(3.33) lim f(x) = Xr, lim f(x) = Ax 

t—> Zo+ 2—-> Xo — 
exist and are different, then the function f is said to have a jump (or an 
ordinary discontinuity) at the point xo The magnitude of the jump is 
Ae — Az|. IfAr > Az, then f has an upward jump, and if Ar < Az, then 
f has a downward jump. 

Another assertion that turns out to be both interesting and important 
is the assertion that a function f and a number L may be such that f(x) 
is near L whenever x is large. When making this assertion precise, we do 
not use the letters e and ô but, instead, use e and some other letter, say 
N, that we can easily regard as a “large” number. The assertion means 
that to each e > 0 there corresponds a number N such that 


(3.34) f(x) — L| <e (x > N). 


By tossing in some surplus verbiage, we can put this in terms that may be 
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psychologically satisfying. Whenever a positive number e is selected, 
we can find a positive number N so large that f(x) approximates L so 
closely that |f(x) — L| < e whenever x is so large that x > N. The 
abbreviated version of this assertion is 

(3.341) lim f(x) = L. 

This is read ‘‘the limit as x approaches infinity of eff of ex is ell,” or “‘the 
limit as x becomes infinite of eff of ex is ell.” This does not mean that 
“infinity” is a place toward which numbers can gallop. All tales about 
infinityt and galloping numbers are completely irrelevant, and there is no 
sense in which x really “becomes infinite.” The assertion (3.341) means 
that f(x) is near L whenever x is large. We examine an example. Every- 
one who has an appreciation of the magnitudes of the numbers 1/2, 
1/416, 1/7,528,432, and 1/102 must believe that 1/x is near 0 whenever x 
is large, that is, 


(3.342) lim = = 0. 


t> 06 


To prove this, let e > 0. Let N = 1/e. Then the inequality 


-o< 
x 


is valid whenever 1 < ex and hence whenever x > 1/e and hence when- 
ever x > N. Thus when a positive number e is given, we are able to find 
a number N for which the e, N assertion is true. Therefore, (3.342) is a 
true assertion. It is equally easy to attach a meaning to the assertion 
that f(x) is near L whenever x is negative and has a large absolute value. 
The abbreviated version of this assertion is 


(3.343) lim f(x) = L. 


We say that the limit as x approaches minus infinity of f(x) is L. 
There are some important modifications of these ideas that should now 
be easily understood. In case f(x) = 1/(* — a) and also in some other 


cases, we can cheerfully assert that f(x) is large whenever x is near a and 
x > a. This assertion is abbreviated to 


(3.35) 


lim f(x) = œ. 
2—at+ 


It means that to each number M there corresponds a ô > 0 such that 


(3.351) f(x) > M (a<x<a+td), 


t For those who are really interested in infinity, a remark appears at the end ot the 
problems of this section. 
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that is, f(x) exceeds M whenever a < x < a+ 6. The assertion that 
f(x) is large whenever x is large is abbreviated to 


(3.352) lim f(x) = œ. 


T- 6 


It means that to each number M there corresponds a number N such that 
(3.353) f(x) > M (x > N). 


It is quite appropriate to recognize that ideas akin to those of this sec- 
tion sometimes appear in elementary geometry books when information 
about lengths of circles is being sought. Let C be a circle having radius 4 
and diameter 1. We can imagine that, for each integer n = 3, we have 
inscribed a regular polygon P, with n sides and have found its length Ln. 
We can assume (or perhaps prove) that there is a number, which we can 
call r, such that L, is near m whenever n is large. By this we mean that 
to each e > 0 there corresponds an integer N such that |L, — r| < € 
whenever n > N. The abbreviated form of the assertion is 


(3.354) lim La = r. 
n> © 

It is not necessary to try to explain how a polygon (which zs something but 
cannot do anything) can sprout more sides and approach the circle as the 
number of “its”? sides becomes infinite. The number r appearing in this 
way is the length of circle of diameter 1. We are all familiar with the fact 
that the length of a circle having radius r and diameter d is 2rr, or rd. 

The ideas of this section have swarms of applications. In particular. 
we can use them to introduce some ideas and terminology of analytic 
geometry. We begin by considering the graph of the equation y = f(x), 
where f is a given function. If 


(3.36) lim f(x) =L or lim f(x) =L, 


then the line having the equation y = L is called a horizontal asymptote 
of the graph. If 


(3.361) lim f(x) = œ or lim f(x) = — œ 
z—=a+ z=a+ 
or lim f(x) = © or lim flx) = —®, 


then the line having the equation x = ais called a vertical asymptote of 
the graph. Employing a modification of these ideas, we consider a case 
in which 4 and B are numbers such that 


(3.37) lim [f(x) — (dx + B)] = 0 or 
lim [f(x) — (4x + B)] = 0. 
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In this case, the line having the equation y = 4x + B is called an 
asymptote of the graph. This asymptote is horizontal if 4 = 0. We 
want to be able to apply similar jargon to graphs of equations, such as 

x? y 
(3.38) 3p 1, 
which are not graphs of functions. When we start with an equation of the 
form (3.38) and transpose all of the terms to the left side, we obtain an 
equation of the form 


(3.381) F(x,y) = 0. 


If f is a function such that (3.381) is true when y = f(x), then each 
asymptote of the graph of y = f(x) is also an asymptote of the graph of 
(3.381). Problem 7 at the end of this section involves the famous 
equation (3.38). 


Problems 3.39 


1 Using epsilons appropriately, give a full statement of the meaning of 
each of the following truthful assertions. In case an assertion is so subtle that 
we are not yet prepared to prove it and appreciate its consequences, we need not 
be disturbed. Scientists can, for example, understand the assertion “‘there is 
helium in the sun” before they are able to prove the fact and understand the role 
of helium in the production of energy radiated by the sun. 


(2) lim = 0 (6) lim t=0 
2— o rI—>—o X 

(c) lim Lee (d) lim l= -o 
20+ x z—0-— x 

(e) lim {x —V/x?-—1} =0 (f) lim Vx = 
z—> o z—0+ 

(g) lim tanx = © (h) lim logx = — œ 
z—r/2 ~ z—0+ 

(i) lim logx = © G) lim = sinx _ 9 

(k) lim 7H! = 1 () lim (1 +2) =e 
zo x — l xz— o x 

(m) lim (14 y =e (n) lim 128% = 0 
z= — o x zo X% 


(0) lim xlogx =0 
20+ 
(7) lim 2% = © 


a o 


(s) lim ez =0 


(u) e = lim [i+e+5454--- 


(p) lim x log (1 +1) =] 
xz—> o x 

(r) lim æ = œ 
x 0 


G) lim |x|" =0, (lel < 1) 


+=| 


n! 
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= |] _#? x‘ _ x8 e.. —_—1])" xn 
Œ) cos = lim [1 mtg at TY ani | 
e E E _ x xe x o. Iya xentil 
(w) sin x = lim E zi tzi 7i t + (-1) eal 
. e x? x? x2 x? 
6) sinas = lim [e0 -0-A a) 
lyr 
(y) x! = lim r Nn a ol, 2, 3, - - -) 
B noe (x + 1)(x+2)%4+3)--- (x + n) 
(s) lim x! = œ 
z—+—1-+ 
2 Does the statement 
approx 4 = 
6r>N N 


abbreviate the statement that to each positive number e there corresponds an 
integer N such that |1/n| < e whenever n > N? Ans.: It can, but it does only 
if we agree that it does. Remark: Whether the above abbreviation is better than 
the abbreviation 

lim 1 =0 

no N 
is purely a matter of opinion. If a person has the habit of using one notation, 
the other must seem to be quite absurd, awkward, and unteachable. 

3 Draw a graph of the equation y = x?. Then, supposing that M is a given 

number, show how the figure can be used to support the assertion that 


lim x? = œ. 
z= 0 


4 One of the assertions 


lim t = æ (?), lim4 = œ (2 
20 X% 2-0 * 
is true and the other is false. Give a full discussion of this matter. Remark: 
Here and elsewhere, displayed assertions followed by question marks may be 
false assertions. 
5 With the aid of the idea that the numerator and denominator of the first 
quotient can be divided by x, show that 


. x£—-1 . 2x? —])1 . 3x3 — 1 
= = 3 
(2) lim i l (4) lim x? + | 2 (c) lim x$ +1 
.  xtbaet3 1 . xt +2 +3 
d x+ 2x +321 lim t2 +320 
(2) lim saati? (¢) lim aats 
(f) lim —X— = 1 (g) lim —— = œ 
z= Vi + x? Ye VJI tx 


6 Show that both coordinate axes are asymptotes of the graph of the equa- 
tion y = 1/x. 
7 There will come a day when we must learn that the graph of the equation 
x? y? 


ap! 
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in which a and b are positive constants, is a hyperbola. Show that if the point 
(x, y(x)) lies on the hyperbola and y(x) > 0, then 


y(x) = 2 Vx? — a. 
Show that 
. b _ 
an [Beso] -0 


and hence that the line having the equation y = (b/a)x is an asymptote of the 
hyperbola. Hint: The formula 


x Vx a“ = 1 A 
x+ vx —a 


turns out to be a useful source of information. 
§ Find the equations of the asymptotes of the graphs of the equations 


(a) y = 2E Ans.ix=l,y=1 
x+ 1X? a _ 
@) s =(255 Ans.x =2,y =1 
() yaa t= Ans: x =O, y=x 
1\? 
(d) y= (=+) Ans:x=0 
(ec) xy=x+y Ans:x =], y =1 
A x+y [=x+y Ans.: None 


()y=Vxt+1— vz Ans: y =0 
9 According to part f of Problem 1, the first of the statements 
lim VWx=0(?), lim Vz =0(?) 
z0+ z—>0 
is true. Is the second statement also true? 
J10 Prove that 
. 1 2 n\ _1 
lim (atat o +3)=5 


Remark: This remark is dedicated to unfortunate individuals who never knew 
or have forgotten that if 


S=1+ 2 + 3 + 4 4-++4(n—-D +n, 


then 
Sa =n + (n—1)+ (n2) +(n—3)+--- + 2 +1 


and addition gives 2S, = n(n + 1), so Sy = n(n + 1)/2. 
If Starting with the definition 


(1) ni=1-2-3--- 2, 
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which is applicable when x is a positive integer, show that 1! = 1, 2! = 2, 
3! = 6,4! = 24, 5! = 120, 6! = 720, and 7! = 5040. Then give a full statement 


of the reason or reasons why it is true that, when z is a positive integer, 


(2) z! = lim 1:2°3-->- 2 
1-2-3 --- 2(2+1)+2)--- (s+n) 


l= li 
OFS De G@FNGF2)°°- FD) 
lz 
l= |; nin nt+Iin+2 .n+2 
9 es eG ED +2)  GEn n n n 
6) l=] nn 


mG FI@+2) GFA 


Remark: To show that the above manipulations serve a useful purpose, we take 
a little mental excursion. A complex number z is a number of the form x + îy, 
where x and y are real numbers and 7 is the imaginary unit for which 72? = —1. 
While this book neither develops nor uses the algebra and calculus of complex 
numbers, we remark that x + iy is the real number x if y = 0 and that x + iy 
is a real integer if y = 0 and x is a real integer. We are now ready to look at 
(5). We have seen that (5) is correct if z is a positive integer and the definition 
(1) is applicable. While proof of the fact lies far beyond our present capabilities, 
it can be proved that the limit in the right member of (5) exists and is a complex 
number whenever z is a complex number which is not a negative integer. More- 
over, when z is a complex number which is not a negative integer, z! is defined 
to be this limit. It follows from the definition that z! is a real number whenever 
zis a real number which is not a negative integer. Carl Friedrich Gauss (1777- 
1855), who had the habit of knowing how things should be done, made very 
effective use of (5). The index can always show us where this and other informa- 
tion about factorials is concealed. 

12 If the preceding problem and remark have been digested, prove that 
O!=1. Remark: Proof of the more esoteric facts that (—})! = vr and ($)! = 
4/x/2 will not be too difficult when more mathematics of the right kind has been 
learned. 

13 Observe that 8! = 8(7!). Then, assuming that the limits exist, prove that 


nint! 
m G@F1+b@ti +2) "(1 +1+n) 


nin? 
= (z+ 1) lim GF DGFI GED FIGF G+ my 


Finally, use the remark of Problem 11 to prove that 
(z + 1)! = (z + 1)(!) 
when z is not a negative integer. 


14 For what pairs of numbers n and k does it make sense to define the binomial 
coeficient function by the formula 


Gi) ACE — IE 


Hint: If necessary, read Problem 11. Ans.: When n, k, and n — k are numbers 
(real or complex) that are not negative integers. 
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15 Try to make friends of the contents of the preceding problems by proving 
that 


o (i) (2) CF) 


when n, k — 1, and n — k are not negative integers. Remark: As some people 
learn while studying algebra, the ordinary binomial coefficients (in which n and 
k are integers for which O < k < n) are the coefficients appearing in the formulas 


(2) (a+b =1 

(3) (a+biz=a+b 

(4) (a + b)? = a? + 2ab + b? 

(5) (a + b)? = a3 + 3a?b + 3ab? + b? 


and, in general, in the binomial formula 


(6) (a+b) = (;) arp + c) arly + C) a-b p -p () apn, 


With the aid of (1), it is easy to fill in the rows of the Pascal triangle 


1 5 10 10 5 1 


which displays binomial coefficients. The sum of two consecutive elements of 
one row gives the element that lies below the space between them, and more 
rows of the Pascal triangle are easily written. 

16 We can feel sure that if x > 1, then x” must be large whenever 7 is large, 
but it is nevertheless worthwhile to be able to prove the precise version of the 
statement. When x > 1, there is a positive number h such that 


(1) x = Í + h; 

in fact, k = x — 1. Observe that 

(2) x= 1 +42h+hk?>1+2h 

(3) x3 = 1 43k 43k? + hk? >11 43h 


and that the binomial formula shows that 
(4) x*>1+ nh 


when n 2 2. It follows that if M is a given number and we choose a number N 
such that N = 2 and N > M/h, then we will have 


(5) x*>1lt+nhkh>nhk>M 


whenever n > N. Therefore, 


(6) lim x” = œ (x > 1). 


n> æ 
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17 We can feel sure that if |x| < 1, then x” is near 0 whenever n is large. 
How can we prove it? Solution: Let e > 0. Suppose first that x = 0. Then 
\x"| < e when n > 1. Suppose finally that 0 < |x] < 1. Let y = 1/|x| so that 
y >1. Then the preceding problem shows that 


lim y” = œ. 
N> © 


If we choose an index N such that y” > 1/e when n > N, then 


[x7] = Ice 
y” 
when n > N. Therefore, 
lim x" = 0 (x| < 1). 
N> O 
18 Prove that 
. 1,1 1 1 
üm [it t... 11]= 
lim |z +++ +5 l 
lim Dtr o $e] =o (|x| < 1). 
Nn © 1 — xX 
Hint: Long division (or factoring) shows that 
1— xot 


O m n a aie n +e = 


and we may use the fact that lim x" = 0 when |x| < 1. 


N- © 
19 Once again, let the “bracket symbol” [x] denote the “greatest integer 
in x,” that is, the greatest integer less than or equal to x, so that [8] = 8 and 
[15.359] = 15. Show that, for each integer n, 


lim [x] = 2, lim [x] =n — 1. 
z—n+ rn — 
20 Prove that if gis a function and 4 and B are numbers such that |g(x)| S 4 
whenever x 2 B, then 


21 Prove that 


Hint: Let 6(x), read “theta of x,” denote the “fractional part”? of x so that 
O(x) = x — [x] and [x] = x — 6(x). 

22 Sketch a graph of the function A for which h(x) = [x]/x when x 2 1, 
and observe that A(x) really is near 1 whenever x is large. 
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23 Sometime we will learn that 


(1) lim ~ =0. 


n— 0 27 
Hence there must be an integer N such that 


n3 1 


(2) a < 100 


when n > N. Some numerical calculations can make us quite sure that (2) is 
valid when n > 20. Even though the author considers the problem to be too 
dificult for assignment at this time, it may be worthwhile to seek a way to deter- 
mine whether (2) is valid when n > 20. 

24 Prove that if x is a rational number, say ~/g, where p and q are integers, 
then sin nlx = 0 for each sufficiently great integer n. Prove that if x is an 
irrational number, then sin n!xx = 0 for each integer n. Using these results, 
show that 

1 — lim sgn sin? alrx = D(x) 
n—> © 
where D is the dizzy dancer function for which D(x) = 1 when x is rational and 
D(x) = 0 when x is irrational. 
25 Some old analytic geometry books pretend to prove that if n is a positive 


integer and Po, Pi, © > > , Pa are polynomials in x, then the line having the 
equation x = x, will be an asymptote of the graph of the equation 
(1) Po(x)y" + Pilx)y™? + + + + + Pa-ilx)y + Pax) = 0 


provided Po(x1;) = 0. These old books present unclear and unreliable treatments 
of matters involving limits and asymptotes, however, and the stated result is 
false. Prove that the line having the equation x = 0 is not an asymptote of the 
graph of the equation 


(2) xy? + xy +1 = 0. 


Remark: An example which establishes falsity of an assertion is called a counter- 
example. Persons who speak German (and many others also) call it a Gegen- 
beispiel. The simpler equation x*y? + 1 = 0 serves the present purpose; the 
graph of this equation is the empty set. 

_ 26 Prove that if fi, fe, fs are continuous at a, if 

(1) lim y(x) = », 


z—a+ 


and if, for some positive number ô, 


(2) Am)y@)P + frle)y@) + f(x) =0 (a<x<a+t ð), 


then fı(a) = 0. Hint: Choose a positive number 6, such that ô; < ô and y(x) > 
1 when a<x<a+56,. Then, supposing that a < x < a + ô, divide the 
members of (2) by [y(x)]? to obtain 


fale) , fale) _ 
he) +3) + yap =o 
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27 For hundreds of years, people have been interested in the magnitude of 
a(x), the number of primes less than or equal to x, when x is large. About the 
year 1900, mathematicians succeeded in proving a remarkable fact that had been 
surmised since the time of Euler (1707-1783). It was proved that 


(*) lim 2%) = 1, 


t— 0 x 


log x 


We may know very little about logarithms and may not yet have learned that, 
in mathematics above the level of elementary trigonometry, logio x denotes the 
logarithm of x with base 10 and log x denotes the logarithin of x with base z. 
We may not yet know how to calculate log x when x is a given positive number. 
Nevertheless we should be able to tell the meaning of the star formula. Do it. 
Remark: Anyone who wishes to make a very modest calculation may use the fact 
that log 20 is approximately 3 and may determine q (20). When working on chalk 
boards and scratch pads, many people make effective use of stars and daggers and 
other things (instead of numbers) to designate significant formulas. The valuable 
idea is illustrated only occasionally in this book. 

28 [tis sometimes said that mathematics is a language. Perhaps it would be 
more sensible to say that mathematics is a collection of ideas and that mathe- 
matics books use language in more or less successful attempts to reveal the ideas. 
In any case, language is important and definitions constitute a basic part of this 
language. To help us realize this fact, we consider an example involving regular 
polygons. A regular polygon is a set in Ee consisting of the points on the line 
segments PoP1, PPa, © © © ,Pn—1Px, PxnPo, where the points Po, Pi, © © © ,Pn,Poare 
equally spaced on a circle, n being an integer for which n 2 3. Under this 
definition, a circle is not a regular polygon. We do not have pencils sharp enough 
to draw regular polygons having a million sides, but we can nevertheless tolerate 
the idea that if we could draw one on an ordinary sheet of paper, then the result 
would Zook like a circle. We cannot, however, tolerate the ancient collection of 
words ‘‘a circle zs a regular polygon having an infinite number of infinitesimally 
small sides” as a part of our doctrine of limits. To take a sensible view of this 
matter, we can know that there was a time when the best of our scientific ancestors 
used fuzzy language and whale-oil lamps but we can also know that they worked 
mightily to produce better products. 

29 As was stated in Section 1.1, a number x appearing in this book is a real 
number unless an explicit statement to the contrary is made. This circum- 
stance does not prohibit recognition of the fact that numbers other than real 
numbers can appear in mathematics. It is possible, and is sometimes worth- 
while, to define and employ a set S* of numbers which contains each real number 
x in the set S of real numbers and, in addition, two numbers — © and œ. When 
the set S* is employed, each real number x is said to be finite and the numbers 
— æ and œ are said to be infinite (not finite). Order relations are introduced 
in such a way that ~œ < œ and —« <x < œ whenever x is a real number. 
While these order relations are simple and attractive, it turns out to be impossible 
to formulate a useful collection and algebraic laws (or postulates) in such a way 
that œ — œ and 0-0 are numbers in S*. Persons starting with enthusiasm 
for — 0 and œ usually lose most of their fascination when they learn that the 
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relations O+% = 1, § = œ, and § = — œ are as absurd in the “‘algebra” of S+ 
as the symbol å is in the algebra of S. We can be momentarily delighted by the 
“algebraic law” which says that œ + x = œ whenever x is a real number, but 
general usefulness of the unorthodox ‘‘algebra” is greatly impaired by the fact 
that the relation y + x = y does not imply that x = 0 because y might be œ 
and x might be 416. For present purposes, we do not need substantial informa- 
tion about these matters, but a little basic information can be very helpful. 
There are circumstances in which — œ and œ are considered to be numbers, but 
there are no circumstances in which — œ and œ are real numbers to which we 
can apply the algebraic rules (or laws or axioms or postulates) that apply to 
real numbers. Whether or not we consider — and © to be numbers, it is 
worthwhile to recognize that some of the most convenient terminologies and 
notations of modern mathematics are relics of times when the “‘doctrine of limits” 
was based upon visions of a number x galloping toward infinity and becoming 
so infinitely great (but still not œ) that its reciprocal becomes infinitesimally 
small (but still not 0). These infinitesimals of mathematics, like the aether and 
phlogiston of physics and chemistry, can now be regarded as mystic absurdities, 
but they were hardy concepts having tremendous impacts upon present as well 
as past science and philosophy. We can conclude these remarks with another 
bit of history. In the good old days when mathematical terminology was incredi. 
bly erratic, sane physicists got the habit of saying that a number is “finite” when 
they wished to emphasize their idea that it is neither zero nor infinite nor infi- 
nitely small nor infinitely large. It will be interesting to see how long physicists 
continue to make modern mathematicians shudder by using the word ‘“‘finite” 
to mean “good honest nonzero noninfinite number, with no nonsense.” The 
physicists have good intentions, but mathematicians consider zero to be a finite 
number, with no nonsense. - 


3.4 Continuity This section contains information about functions 
and limits that we will need. Our first task is to obtain a full under- 
standing of the following definition. 


Definition 3.41 A function f is continuous at xo (or at the point with 
coordinate xo, or at the point xo) if 


lim f) = f(x). 


The assertion that f is continuous at x» is nothing more nor less than the 


assertion that f(x) is near f(xo) whenever x is near xo. It means that to 
each e > 0 there corresponds a ô > 0 such that 


(3.42) lf) — flxo)| < e (lx — xo] < ô). 


The definition implies that f cannot be continuous at xo unless f(xo) 
exists, that is, unless xa belongs to the domain of f. In case f(%o) exists, 
the first inequality in (3.42) automatically holds when x = xo and we do 
not need to bother with the restriction x = x that appears in the defini- 
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tion of limit. With a small change in notation, we can see that f is 
continuous at x if and only if 


(3.421) lim fle + Ax) = fle) 
or 
(3.422) lim [fl + x) — f@)] = 0. 


Figure 3.43 shows, for the case in which f(x) = x? and Ax > 0, the 
geometric interpretations that can be given to the numbers appearing in 
these formulas. 


cr —— Ee Ome ae cee ce eee or 


F 
Ay=f(x+ Ax) -f(x) 


a So 
j f(x+ Ax) 
f(x) 


x 
Figure 3.43 


Definition 3.44 A function f is said to have right-hand continuity at a 
if the first of the assertions 


(3.441) jm, fœ) = fla), lim f(x) = f(b) 


is valid and to have left-hand continuity at b if the second is valid. 
Supposing that a < b, we can let fı be the function having the graph in 
Figure 3.442 so that fi(x) = 0 when x < a, fi(x) = 1 whena Sx S b, 
and fi(x) = 0 when x >b. This function is continuous at each x for 
which x = a and x Æ b. The function has right-hand continuity at a 


i 
1 l 1 
a b x G ð 
Figure 3.442 Figure 3.443 


and has left-hand continuity at b. It does not have left-hand con- 
tinuity at a, and it does not have right-hand continuity at b. Let fe 
be the function having the graph in Figure 3.443 so that fa(x) = 0 when 
* Sa, fox) = 1 when a <x <b, and fox) =0 when x2 b. This 
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function, like fi, is continuous except when x = a and x = b. However, 
fo has left-hand continuity at a and right-hand continuityf at b. 

Definition 3.45 A function f is continuous over an intervala Sx <b 
if it is continuous at each xo for which a < xo < b and, in addition, has 
right-hand continuity at a and left-hand continuity at b. 

The definitions of this section are designed to be useful in discussions of 
examples of functions, and we begin by looking at examples of functions. 
Let g be the function, defined for x # 0, for which 


(3.451) g(x) = : (x = 0). 


This function is continuous at each xo Æ 0 because, when xo ¥ 0, our 
theorems on limits imply that 


, oe 1 _1_ 
(3.452) lim g(x) = limy = imk n g(xo). 


However, g cannot be continuous at 0, because g(0) is undefined and there 
is no possibility of having lim g(x) = g(0). We say that g is discon- 
z—0 


tinuous at 0. Now let A be the function defined over —œ <x < œ 
(this means merely that the domain of h is the entire set of numbers) by 


h(0) = Oand 
(3.453) h(x) = 


R | re 


(x = 0). 


This function, like g, is continuous at each x) Æ 0, but this time h(0) 
exists and there is no possibility of having lim h(x) = A(0), because 
x—0 


lim h(x) does not exist. Let w (omega) be the peculiar function for 


20 

which w(0) = 1 and w(x) = 0 when x #0. For this function both 

w(0) and lim w(x) exist, but the function is discontinuous at 0 because 
2-0 


(3.454) lim w(x) = 0 #1 = (0). 


z—0 


t It is to be expected that some readers, particularly those more interested in applied 
mathematics than in pure mathematics, may feel that matters now being considered are 
much too theoretical to have practical interest. Some people know, and others can learn, 
that when a battery has its terminals connected to appropriate electrical hardware, it almost 
instantly produces an electromotive force (the kind of a force that pushes or pulls electrons 
around) which we may, for present purposes, suppose to have the constant value 1. When 
the battery is not connected, the electromotive force produced by itis 0. Thus, batteries 
which are connected over some time intervals, and disconnected over other time intervals, 
produce electromotive forces that are, as functions of time, very closely approximated 
by step functions such as those we have been considering. The discontinuous functions 
are introduced to simplify problems, not to complicate them. This is one of the reasons 


why persons interested in applications of science must recognize existence of discontinuous 
functions. 
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The graphs of the signum and Heaviside functions shown in Figures 3.191 
and 3.192 should indicate that these functions are continuous every- 
where except at x = 0. One who has seen numerous examples of func- 
tions and their graphs should realize that he can enter the construction 
business to produce more examples. He can start with a clean coordinate 
system and, as in Figures 3.46 and 3.47, mark points +x, tx2, tx3, °° ° 


meee eee ee ee ee ee One owe eee ee 


Figure 3.46 


on the x axis and then sketch a part of a graph which oscillates through 
these points in any way he likes. Provided only that the graph contains 
no two different points having the same x coordinate, the graph will be 
the graph of a function. In Figure 3.46 the graph is drawn tangent over 
and over again to the lines having equations y = 1 and y = —1. In 
Figure 3.47 the graph is drawn tangent over and over again to the 


Figure 3.47 


parabolas having the equations y = x? and y = —x*. It can be shown 
that the graphs of the functions defined by gi(x) = sin(1/x) when x # 0 
and g(x) = x? sin(1/x) when x = 0 and g2(0) = 0 look very much like the 
graphs in Figures 3.46 and 3.47, but we need not worry about this matter 
now. It should be clear from Figure 3.46 that gı cannot be continuous 
at x = 0 because lim gi(x) does not exist. For the function g» the story 


is different. Since 


(3.471) —x? S glx) S x?, 


148 Functions, limits, derivatives 


it follows from the sandwich (or flyswatter) theorem that 


(3.472) lim g(x) = 0 = 92(0), 
z—0 
SO gg must be continuous} at x = 0. 

It is easy to prove fundamental facts about functions formed by com- 
bining continuous functions in various ways. With the aid of Theorem 
3.285 on limits, we see that if A(x) = f(x) + g(x) over an interval con- 
taining xo, and if f and g are continuous at xo then 

lim A(x) = lim f(x) + lim g(x) = f(x) + g(%o) = A(%o). 

T> ZO T Xo T> Zo 
This shows that the sum of two continuous functions is continuous where- 
ever the terms being added are both continuous. Very similar argu- 
ments show that the product of two continuous functions is continuous 
wherever the factors are continuous and that the quotient of twocontinuous 
functions is continuous whenever the numerator and denominator are 

continuous and the denominator is not zero. 
4 We should now see that the function f, which 
is defined over the interval —1 S x < 1 and 
which has the graph shown in Figure 3.48, is 
continuous over the interval 0 < x S 1; it is 
Zi 0 i x continuous at each xo for which 0 < xo < 1,it 
Figure 3.48 has right-hand continuity at 0, and it has left- 

hand continuity at 1. As a bonus for knowing 
about limits, unilateral limits, and continuity, we find that we can easily 
understand and remember some fundamental facts that are frequently 
used in applied as well as in pure mathematics. A function f has a 
limit as x approaches a if and only if the two unilateral (right and left) 
limits exist and are equal. The function is continuous at a if and only 
if the two unilateral limits exist and are equal to f(a). 


Problems 3.49 


I The statement that 
5x3? + 2x? — 4x + 16 


is continuous is an abbreviation of the statement that the polynomial function 
P having values P(x) defined by the formula 


P(x) = 5x3 + 2x? — 4x + 16 


t It has sometimes been thought to be meaningful, and perhaps even true or helpful or 
both, to say that a function f is continuous if and only if “it is possible to draw the graph of 
f without lifting the pencil from the paper.” Enthusiasm for this statement must be chilled 
when we realize that a continuous function may have an infinite set of oscillations in a 


finite interval and that feeble mortals never succeed in drawing more than a finite set of 
them, 
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işs continuous. Prove the statement by filling in the intermediate steps in the 
formula 


lim P(a) = + + © = Pa) 


and tell which theorems on limits are used in the process. Remark: The same 
procedure shows that each polynomial is continuous. 
2 Letting 
_ (& — 1) — 3) 
Q) = G—2)@ — 4) 


show that Q is continuous at each x except 2 and 4. 

3 Prove that the quotient of two functions is continuous wherever both 
functions are continuous and the denominator is not zero. Remark: We recall 
that the quotient of two polynomials is sometimes called a rational function. 
Our results show that a rational function is continuous wherever the denominator 
is not zero. 

4 Determine the points of discontinuity of the functions fı, etcetera, for 
which 


(a) file) =F E) fils) = 7S 
1 
(c) fa(x) = x1 — x) (d) fax) = |x| 
1 
O fle) = sa O Ae) = 


5 Does the assertion 
approx f(x) = fla) 
é,|a—al<s 
abbreviate the assertion that to each positive number e there corresponds a posi- 
tive number ô such that [f(x) — f(a)| < e whenever |x — a| < ô? Ans.: It 
can, but it does only if we agree that it does. 

6 Taxi fare is 50 cents plus 10 cents for each quarter mile or fraction 
thereof. Letting f(x) denote the fare for a ride of x miles, sketch a graph of f 
and tell where f is discontinuous. 

7 Assume (as is not quite true) that it takes 0.5 calorie of m- p 
heat to raise the temperature of 1 gram of ice 1 degree centigrade, 100 
that it takes 80 calories to melt the ice at 0°C, and that it takes 80 — 


1.0 calorie to raise the temperature of 1 gram of water one degree 6— 
centigrade. Supposing that —40 S x < 20, let Q(x) be the 40 — 
number of calories of heat required to raise one gram of H,0 > 
from temperature —40°C to x°C. Sketch a graph of Q. Ans: Zb è 


Figure 3.491. 

8 The magnitude of the gravitational force which the earth 
exerts upon a particle is called its weight W. Suppose (as would 
be true in the mechanics of Newton if the earth were a homogeneous spherical 
ball) that there exist constants kı and kə such that 


W = kx (OSs x<R) 
= ~“s (x 2 R), 


Figure 3.491 
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where x is the distance from the center of the earth to the particle and R is the 
radius of the earth. Supposing that W is a continuous function of x and that 
W = 100 when x = R, calculate kı and ke and sketch a graph of W versus x. 

9 Prove that if f is continuous at xo, then so also is the function g having 
values defined by g(x) = |f(x)|. 

10 It is never too soon to start becoming acquainted with the idea that if, 
during some time interval t; < ż < te, a bumblebee or molecule or rocket buzzes 
around, then at each time ż in the interval it is surely someplace and that if we 
let f:(t), fo(t), fa(¢) denote its x, y, z coordinates at time ż, then fi, fe, f3 are con- 
tinuous functions of ¢. Since wholesome comprehension of mathematics is salu- 
brious, we recognize that we do not quite know how to prove that bumblebees 
never fly out of our £3 for a minute or two. Moreover, we do not know how to 
devise a mathematical proof that a bumblebee cannot gather honey all morning 
in Pennsylvania, be in Chicago at noon, and hunt clover in Los Angeles all after- 
noon. The best we can do is make the physical assumption that fı, fo, fs are 
continuous and know what the assumption means. What does the assumption 
mean? Ans.: If t; < t< te, then 


lim f(t + At) = f(t) 
At—0 


when k = 1, when è = 2, and when & = 3. 

11 Abandoning some of the notation of the preceding problem, we suppose 
that x, y, z are given functions that are continuous 
over some interval in which ż is supposed to lie. 
Let P(t) denote the point in £3 having coordinates 
x, y, z for which x = x(t), y = y(t), and z = 2(t). 
While the fact will be considered later with more 
details, we can pause to learn that the ordered set 
of points P(t), ordered so that P(t’) precedes P(t”) 
Figure 3.492 when 2?’ < £t”, is called a curve C. The point P(t) 

is then said to move along or traverse the curve C asi 
increases. Figure 3.492 may be helpful. For each z, let r(#) be the vector run- 
ning from the origin O to P(t). This determines a vector function r for which 


(1) r(t) = x(t)i + y(é)j + z()k. 


Conversely, if r is a given vector function, then it (and the given coordinate 
system) determines its scalar components. From (1) and 


(2) r(t + Az) = x(t + Asi + y(t + Adj + z(t + ADK 


we obtain 


(3) rE + At) — r) = [xt + Ad) — lli t be + Ad — yD] 
+ [z( + At) — 2(2)]k 


and 


(4) |r + At) — r(¢)| = [lx + As) — Ol + [ye + Ad) — yal 
+ |z(¢ + At) — 2(2)|?]4. 
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As is easy to guess, the vector function r is said to be continuous at ¢ if 
(5) lim |r( + Az) — r(t)| = 0, 

At—0 
and we write 


(6) lim r(t) = w 


t—to 


if w is a vector for which 


(7) lim |[r(z) — w| = 0. 
tto 


It is a consequence of (4) that a vector function is continuous if and only if its 
scalar components are continuous. 
12 Using ideas from the preceding problem, let 


r(t) = x(t)i + yOj+ z()k 
w = ai+ bj + ck 


and prove that lim r(¢) = w if and only if 
t— to 


lim x(t) = a, lim y(t) = b, lim 2(¢) = c. 
t—>to t— >to t— lo 
Hint: Write and use a formula for [r(¢) — wl. 

13 Once again, let the symbol [q] denote the greatest integer which is less 
than or equal to g. Let f be the function for which 


so- [E] 


when x > 0. Draw the graph of f and tell where f is discontinuous. 
14 Using the “bracket notation” of the preceding problem, determine whether 


im = [5] 
lim x | - 
z—0+ x 
exists. 
I5 Letting D be our old friend, the dizzy dancer function, for which 


D(x) = 0 (x irrational) 
D(x) = 1 (x rational), 


show that there is no a for which 


lim D(x) = D(a) 


and hence that this function is everywhere discontinuous. 

16 A potential new friend g is defined over the closed interval 0 Sx <S lin 
an interesting way. If x is irrational, then g(x) = 0. If x is 0, then g(x) = 1, 
and if x = 1, then g(x) = 1. Ifxis a rational number for which O < x < 1 and 
if x = m/n, where m and n are positive integers having no common positive 
integer factor exceeding 1, then g(x) = 1/n. Thus g(4) = $, g(3) = $, e() =$, 
g(t) = t, (È) = 4, e(b = t, ($) =F, etcetera. Sketch a figure indicating the 
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nature of the graph of g. Show that g is discontinuous at each x for which x is 
rational and that g is continuous at each x for which x is irrational. Hint: If eis 
a given positive number, then the set of numbers x for which g(x) > € contains 
only a finite number of elements. ‘This fact is useful. Remark: While interest 
in the matter should be postponed, this is an example of a bounded function 
having a countably infinite set of discontinuities. Moreover, each subinterva] 
of the interval 0 S x < 1 contains an infinite set of these discontinuities, but the 
set of discontinuities has Lebesgue measure zero. The function g is the famous 
corn-popper function. 

17 Some people know very much about the function F for which F(r) is the 
number of lattice points (points having integer coordinates) lying inside and on 
the circle of radius r having its center at the origin. Give at least a little precise 
information about F. 

18 Give an example of a function f such that 0 < f(x) S 1 whenO0 Sx <1 
and such that f is continuous at each point of the interval 0 S x S 1 except at 
DEEE 

19 Give an example of a function which (i) is defined over the closed interval 
0 = x S 1, (ii) is continuous over the open interval 0 < x < 1, and (iii) is not 
continuous over the closed interval 0 Sx Sl. 

20 Show that if x1, x2, x3 and 4, B, C, D, E are constants for which x1 < xa < 
xz and C #0, D = 0, E #0, and if 


f(x) = Ax + B 4 Cļ|x — x| + Dix — xo] + Elx — xl, 


then f is continuous and the graph of f is a broken line consisting of line segments 
joined at vertices whose x coordinates are x1, x2, %3- 


21 Let 
(1) J(=) = —x (x < 0) 
f(x) =x (O0 Sx <!) 
fx)=2-x (lsx<2) 
f(x) =0 (x 2 2), 


so that the graph of fis a broken line having corners at the points (0,0), (1,1), and 
(2,0). Determine five constants 4, B, C, D, E such that 


(2) f(x) = Ax + B+ Clx| + Dix — 1| + Elx — 2l. 


Hint: For each of the four intervals x $0,OSxS51,1S%8 2, andx 22, 


replace the left member of (2) by the appropriate expression and replace the right 
member of (2) by the appropriate expression not involving absolute-value signs. 


Ans.: 
f(x) = —ax + |x] — [x — 1] + dx — 2l. 


3.5 Difference quotients and derivatives Let f be defined over an 
interval a < x < b and let x be a number for which a < x < b. Let 


Ax be a number, which may be positive or negative but not 0, for which 
asx+Ax Sb. We may then set 


(3.51) y = fx), yt Ay = f(x + Ax), 
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subtract to obtain 
Ay = f(x + Ax) — f(x), 
and then divide by Ax to obtain 


(3.52) a = fa t Ae) ~ fe) 


This quotient, which is clearly a quotient of differences that are calculated 
in a special way, is called a difference quotient. Difference quotients have 
already appeared in our problems, and we shall see later that they have 
important interpretations. Leaving the hosts of applications to be par- 
tially revealed later in this textbook, and to be continually revealed to 
those who pursue further studies in the sciences (including mathematics), 
we now come to one of the two most important ideas in the calculus. If 
the difference quotient in (3.52) has a limit as Ax approaches zero, then 
f is said to be dzfferentiable at x and the limit is called the derivative of f 
at x. In case the limit fails to exist, the function is said to be nondiffer- 
entiable at x and we say that the derivative of f at x does not exist. 
There are two very different and very useful notations for derivatives. 
The first, appearing in the formula 


(3.53) f'(x) = lim = = lim fx + Ax) = f(x), 


Ax—0 Ax 


is usually read “‘eff prime of ex,” but it can be read ‘‘eff prime at x” or 
“the derivative of f at x.” This “prime notation” is called the Newton 
(1642-1727) notation.t The second notation, appearing in the formula 


(3:54) D _ lim AY = lim ETA) — f@) 


dx  azso Ax = a3 Ax 


is read “dee y dee x” or “the derivative of y with respect to x’ and was 
originated by Leibniz{ (1646-1716). There will be times in the future 
when we will consider dy/dx to be the quotient of the two numbers dy 
and dx. Meanwhile, the whole symbol dy/dx is to be regarded as a 
single symbol, just as the symbol H represents a single letter of the alpha- 
bet and not 11 divided by 11. A longer and perhaps dismal discussion of 
this terminology and notation appears in a remark at the end of the prob- 
lems of this section; congratulations can be bestowed upon readers wise 
enough to know that the discussion is semisuperfluous. 

According to an old and honorable tradition, the definition of dy/dx and 


t The original Newton notation was the “dot notation” or the “flyspeck notation” 
which employed f instead of f’, but replacing the dot by the prime is a clerical modification 
that preserves the original idea of Newton. 

ł Leibniz, like Newton, published his scientific works in Latin. The Latin spelling 
“Leibnitz” is sometimes seen and sometimes helps people to pronounce the name correctly. 
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the manner in which it is applied can be (or should be) remembered with 
the aid of the famous “‘four-step rule.” We may not always get 4 when 
we count the steps, but the rule is the four-step rule anyway. 


Four-step rule 3.55 


Definition Application 
y = f(x) y = x? 
y + Ay = f(x + Ax) y + Ay = (x + Ax)? = x? + 2x Ax + Ax? 

Ay = f(x + Ax) — f(x) Ay = 2x Ax + Ax? 
Ay _ fle +Ax)— fx) wL 
Ax Ax Ax 2x + Ax 
dy _ |. Ay dy _ 
de ~ I Ax dx T ** 


The steps are as follows: select (or “‘fix”) an x in the domain of f, write 
y = f(x), introduce Ax, write y + Ay = f(x + Ax), subtract to get Ay, 
divide by Ax to get Ay/Ax, and, finally, find the limit as Ax — 0 to obtain 
dy/dx. Whether we consciously use the four-step rule or not, we all 
need experience in the art of calculating derivatives by finding limits of 
difference quotients, and problems at the end of this section provide 
some of it. Meanwhile, we gain experience by proving the following 
formulas which can be and must be remembered. 

Theorem 3.56 If u and v are differentiable functions of x and if c 
and n are constants, then 


(3.561) fut) = $42 

(3.562) © cu = 

(3.563) E an a ng} 

(3.564) ft = numi 

(3.565) $ usuy 
d u T ue 

(3-366) dio a 


provided that v ¥ O in (3.566) and that in (3.563) and (3.564) we have 
x Æ 0 and u # 0 when n is a negative integer and (except in some special 
cases) x > 0 and u > O when n is not an integer. 

The first three of these formulas enable us to obtain results like 


E (at — Bat + Sa? — Te + 6) = det — 9x? + 10x — 7 
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as rapidly as we can write. Thus scientists differentiate polynomials with 
gusto. Using (3.563) with n = —4 gives 
d 1 d 1 


= — yh = — = —% 


dx a/y dx 2 


when x > 0, and using it with n = % gives 


d d 3 
— = — 7% = = xh 
ZV an 5x 


when x > 0. 

The last formula (3.566) can be remembered for years with the aid of a 
little trick. We remember that the derivative of a quotient is a bigger 
and better one and begin by drawing a long line to separate the numerator 
from the denominator. We continue by putting v? in the denominator 
and then, while the v is in mind, begin the numerator by writing v. This 
starts things right, and the rest can be remembered. 

In our proof of the theorem, we fix (or select) an x in the domain of the 
functions and put u = u(x), o = o(x), u + Au = u(x + Ax), 


v + Ao = o(x + Ax) 


so that 
du ,. Au _ , u(x + Ax) — u(x) 
dx BM ie AM Ax 
do p» Aov_,. o(x + Ax) — r(x) 
de T DA As im, Ax 


We prove (3.561) and (3.562) together by starting with 
y = cu + cw, 


where c and c; are constants; we can put ¢ = ¢ı = 1 to get (3.561) and we 
can take cı = 0 to get (3.562). Then 


y + Ay = c(u + Au) + ¢1(0 + Av) 
and subtraction gives 


Hence 
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Therefore, an application of a theorem on limits gives 


dy 
Bachite i 


As has been remarked, putting c = cı = 1 gives (3.561) and putting 
cı = 0 gives (3.562). 

Postponing (3.563) and (3.564), we start proving the product formula 
(3.565) by setting y = uv. Then 


y + Ay 


(u + Au)(o + Av) 
= uy + u Av + v Au + Au Av, 


so Ay = u Av + v Au + Au Av. Dividing by Ax and inserting an extra 
factor Ax in the numerator and denominator of the last term give 


(3.57) Ay Ao Au Ao, 


dy — dv du do, 
dx “det +E Fo 


The last term is zero, and this proves (3.565). Proof of the quotient 
formula (3.566) is very similar, but the formula is important and we shall 
prove it. Lety = u/s. Then 


u + Au 
y + Ay = v + Av 
Ay = St Au _ x _vAu~ ube 
o + Av 9 g? + v Av 
jaw Ae 
(3.571) Ay Axr _ Ax 
Ax o? + o — Ax 


and this proves (3.566). 

The power formulas (3.563) and (3.564) remain to be proved, and we 
deal with (3.563) first. Lety = x”. In casen = 0, we have y = 1 and 
must prove that dy/dx = 0. This is true because if y = 1 for each x, 
then Ay = 0, so Ay/Ax = 0 and hence dy/dx = 0. In case n = 1, we 


3.5 Difference quotients and derivatives 157 


have y = x and must prove that 


This is true because if y = x,then Ay = Ax,so Ay/Ax = 1 and dy/dx = 1. 
The case = 2 is covered in the application under the four-step rule 
headline. There are several somewhat different ways to obtain (3.563) 
for greater integer values of n. Perhaps the most informative method 
consists of using the product formula (3.565) to obtain 


du, duz 


dus 
= = UU; + Uz u Uiu? = 
dx dy Us T Wile Fe 


and then putting u; = u: = u; = x to obtain 


dx” nl 
(3.572) Tn 7 1 


when n = 3. Another application of the same idea, in which the product 
Uiueuzu4 is written as the product (uiw2u3)u4 of two factors, gives 


duyuct3t4 du Usuru u dus uau uu dus ) UrUou dits 
Ogg + | ar 4 + 1U s— Ug + Uue ~~? 
dx dx dx ° dx =" 3 dx 


and putting uw; = uz = uz = u, = x gives the formula (3.572) for the 
case in which n = 4. The same procedure gives the result for greater 
integers. Perhaps the simplest proof can be based upon the fact that if 
(3.572) holds for a given n, then use of the product rule gives 


n+l 
dai d aa o gnat 4 ard = (nt Dar 


dx dx 


Since the formula is valid when n = 0, mathematical induction shows that 
it is valid when n% is a nonnegative integer. In case nisa negative integer 
(so that —n is a positive integer) and x # 0, the result is proved by the 
calculation 


dx» d 1 x*Q—1-(—n)xo™! © 


dx dx x7" xen 


which involves the formula for the derivative of a quotient. In case n is 
a constant which is not an integer, (3.563) is still valid at least when 
x > 0. Proof of this appears in Theorem 9.27, and proof of (3.564) then 
follows from the chain rule of Theorem 3.65. 
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Very much more about derivatives remains to be learned, and we give 
a modest but important contribution to theory by proving the following 
theorem. 


Theorem 3.58 If f’(x) exists, then f must be continuous at x. 
Our hypothesis and a theorem on limits enable us to write 


jim [fle + Ax) — -JON = jim [E = LO) a 


=| i m FEE AD PE |li aa = f'(x)-0 = 
aaO âz—> 
Therefore, 

lim f(x + Ax) = f(x), 


and it follows from this that f is continuous at x. 

The attainment of a technique for differentiating accurately and 
efficiently is of prime importance in the calculus. When we are called 
upon to evaluate the left member of the equation 


d x _ (Q+) — (2x) 1— # 


we should say to ourselves “the derivative with respect to x of u (mean- 
ing x) over v (meaning 1 + x?) is equal to the quotient with denominator 
o? [write (1 + x?)?] and numerator v [write (1 + x?)] times du/dx [write 1] 
minus u [write x] times dv/dx [write 2x]? We must learn to talk to our- 
selves in such a way that we can quickly produce such results as 


—n_ = 
ee EY 
— 


dx x x? “x 

d x (1 — x2) (1) — x(—2x) _ 1+? 
dx1l— x? (1 — x?)? (I — x2)? 
d 1 _0- 0 — 1(2x) _ 2x 

dxltx? (+x (Fx 


With the formula for the derivative of a product in mind, we obtain 


Z atr tH 2e 1) = G aH (Qe H 2) 
+ (x? + 2x + 1)(2x — 1) 


by saying “‘the derivative with respect to x of u (meaning x? — x + 1) 
times v (meaning x? + 2x + 1) is equal to u (write x? — x + 1) times 


do/dx (write 2x + 2) plus v (write x? + 2x + 1) times du/dx (write 
2x — 1).” 
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Problems 3.59 


1 Give the definition of the derivative of f atx. Ans.: (3.53) or (3.54). 
2 Find dy/dx from the definition of derivatives and then check the answers 
by use of formulas for differentiation when 


(a) y= Vx (b) y = v7 

Orsi D y= Ty5 
@y=p5 (Ny = 
Oy => t) y= 75 


3 Sometimes we are given a formula for f(x) and are required to find the 
derivative of f at a, where a is a number given in decimal form. In some cases 
it is easiest to find f'(a) directly from the formula 


or 


f(a) = lim f(a + Ax) — fla) f(a) = tim 262 + h) — f(a). 
4x20 Ax h~0 h 
In some cases it is easiest to work out a formula for f'(x) and put x = ain the 
result. Work the problem both ways when 


(a) f(x) = x? — 3x + 1, a = 416 
(b) f(x) = x2? — 3x4+1,2=0 


(c) f) =y l=? 


(2) fle) = E azo 


v4 Formulas for derivatives are often wonderful, but there are times when it 
is best to use the definition of derivatives to obtain f'(a). Letting g(x) = x|x|, 
find g’(0) or show that g’(0) does not exist. Ans.: 


g'(0) = lim g(h) 7 g(0) _ lim h= — 0, = 0. 


h—0 a0 
5 Supposing that 


=l] +r tetta’ 
= § + 4x — 3x? + 5x8 + x$, 


tell what facts or formulas or both enable us to write 


dy 
dx 
dz 


g T ET OH t 15x? + 4t. 


= 1 + 2x + 3x? + 4x? 
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6 Calculate 
d 


by use of the product formula. Then multiply the given factors and differentiate 
the result. Make the answers agree. Hint: Look at (x? + 3)(x? — 2) and read 
u (meaning x? + 3) times v (meaning x? — 2). Then apply the formula for the 
derivative of uv. 

7 This is another lesson on use of formulas. It is expected that persons 
studying calculus are familiar with the “quadratic formula.” When we want to 
find the values of x for which 


2x2 + 3x —4=0, 


we say “ax? + bx + c = 0” and, without writing anything, realize that we put 
a = 2 and b = 3 andc = —4 in the memorized formula 


—b+ Vb? — 4 ac 
2a 


—3 + V9 F 32. 
—— yp 


Then we write only 


x = 


When we use differentiation formulas, we should be equally efficient. When we 
must differentiate 


(1) y = (3x? + 1)5, 


we should realize that we must differentiate something of the form u” (not x”), 
where u is a function of x. The formula 


(2) < u” = nur! a 

should come into our minds but should not be written. We should look at (1) 
and read “y equals u to the nth power” and realize without writing anything 
that u = 3x? + 1 and n = 5. We should then say ‘“‘dy/dx equals n (write 5) 
u (write 3x2 + 1) to the power n — 1 (write 4) times du/dx (write 6x).? Thus 
we look at (1) and, after a little chat with ourselves, write 


(3) 2 = 5(3x? + 1)*-6x 


= 30x(3x? + 1)4. 
Minor modifications of this technique can be tolerated, but speed and accuracy 
must be developed. Write the formula (1) and practice differentiating it as a 


golfer practices putting; perfection is required. 
8 Look at the calculations 


y = (1 — x°)%, z= (1 + x?) 
dy 1 dz 


Bil), FE = (1 + 202) 


until you see where they come from and understand them thoroughly. Nothing 
is to be written. 
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9 Calculate 
d1l— x? d 
daif gT x) 


by the quotient formula and by the product formula. Make the results agree. 
10 [It can be observed that the sum of the first two of the expressions 


x? 1 
———_=_) —— 2 
Jeri yayi Vit 


is the third. Find the derivatives of these things and check the results by show- 
ing that the sum of the first two is the third. 
11 The formulas 


— sin x = cos x — cos x = — sin x 
dx ? dx 


will be proved in Section 8.1. Copy them on a nice clean piece of paper, and 
take a casual look at the calculations 


d sinx cos? x + sin? x 1 
— tan x = 3 = aL S Fes = sec? 
d dx cos x cos? x cos? x 
d cos x — sin? x — cos? x — 1 
— cot x = ooo CUM Fe T= ec 
d “ sin x sin? x sin? x 
d . 1 sinx 
sec x = £ (cos x)! = — (cos x)™?(— sin x) = = sec x tan x 
dx cos x COS x 
d —] cosx 
csc x = £ (in x) = — (sin x)~?(cos x) = =—— =—— = — csc x cot x. 
ax sin x sin x 


Then, with the calculations out of sight, try to reproduce them. 
12 Show that 
dax+b  ad— be 
dxcx +d (ex +d)? 


13 Supposing that n is a positive integer and x Æ 1, show how the identity 
(1) R O a a D a R fae = 


can be used to obtain the less elementary identity 


nant) — (n + lj)? +1 
(x — 1)? 
Multiply by x and differentiate again to obtain another identity. 


14 Calculate the coordinates of the points on the graph of y = f(x) at which 
f'(x) = 0 when 


(2) 1 + 2x + 3x2 +e. tnx l= 


(a) f(x) = x3 — 3x Ans.: (~1,2) and (1,—2) 
(b) f(x) = x3 — 3x +2 Ans.: (—1,4) and (1,0) 
(c) f(x) = 2x 7 3 Ans.: None 
(4) f(*) = 773 F = Ans.: (0,0) 


(e) f(x) = ns.: (—1,—4) and (1,3) 
1 r x? 
Cf) f(x) = ax? + bx + Ans.: (—b/2a, —(b? — 4ac\/40) 
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15 A long time ago it was discovered that if P is a polynomial, then between 
each pair of values of x for which P(x) = 0 there must be at least one value of x 
for which P’(~) = 0. For the special case in which 


P(x) = (œ — 1)@ — 2)(@ — 3), 


find the values of x for which P(x) = 0 and the values of x for which P’(x) = 0, 
and verify the statement about the zeros of P and P’. Remark: This matter will 
become quite unmysterious when we learn about the Rolle theorem. 

16 In connection with the definition of the derivative of a function f at a 
point x, we recognized the possibility that this derivative may fail to exist, 
To clarify this matter, we should know about the simplest example of a con- 
tinuous function f which is not everywhere differentiable. To investigate such 
matters, we should know about the right-hand derivative f(x) and the left-hand 
derivative f_(x) that are defined by the formulas 


f(x) = lim fe + Ax) — fe) 


f(x + Ax) — f(x) 
420+ Ax _ 


Az—0 Ax 


f(x) = lim 
when these limits exist. It is easy to guess and almost as easy to prove that f is 
differentiable at x if and only if 


fal) = f(x) = f'(x). 


For the simplest example in which f(x) = |x|, show that f}(0) = 1 and f(0) = —1 
and hence that f’(Q) does not exist. It is not so easy to construct a continuous 
function which is everywhere nondifferentiable, but Weierstrass (1815-1897) 
started this construction business a long time ago. 

17 Construct and look at a graph similar to those in Figures 3.46 and 3.47 
but having the loops tangent to the lines having equations y = x andy = —x. 
Letting this be the graph of f and, letting f(0) = 0, discuss continuity of f at 0 
and discuss f,(0), f_(0), and f’(0). 

18 For reasons that may be partially explained by the remark at the end of 
this set of problems, we give, in terms of the notation of Newton, a complete 
statement and proof of the part of Theorem 3.56 that involves the product 
formula (3.565). 


(1) Theorem If g and h are functions differentiable at x and if f is the func- 
tion for which 


(2) F(x) = g(%)h() 
when x belongs to the domains of g and h, then f is differentiable at x and 
(3) F(x) = g(x)h'(x) + h(x)g’(x). 
Proof: Since g and h are differentiable at x, there must be an interval J with 


center at x over which g and h are defined. When x + Ax lies in this interval, 
we have 


(4) fæ + Ax) = g(x + Ax)a(x + Ax) 
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and hence 
(5) F(x + Ax) — f(x) = g(x + Ax)h(x + Ax) — g(x)A(x). 


To make the right side more tractable, we subtract and add the term g(x + Ax)h(x) 
and then divide by Ax to obtain 


f(x + Ax) — f(x) _ 


o) EER TIO L gf tax) EEA = FO) 


+ A(x) gle + An) ~ ge) 


xX 


But the hypotheses of our theorem, the definition of derivative, and Theorem 
3.58 imply that 


m h(x + Ax) — h(x) 


(7) lim g + Ax) = g), lim AG = h'(x), 
_ g(x + Ax) — glx) _ , 
umo Ae TEO 


It follows that the limit, as Ax approaches zero, of the right member of (6) is the 
right member of the formula 


lim BEEE TAD L KK) + Ag). 


(8) 
The limit of the left member must be the same. Therefore, (8) holds, and (3) 
then follows from the definition of the derivative of f at x. This proves the 
theorem. This proof is essentially the same as the proof involving (3.57). If 
we set u = g(x), 9 = h(x), y = f(x), u + Au = g(x + Ax), v + Ao = A(x + Ax), 
and y + Ay = f(x + Ax), then (6) becomes 


Ay Av Au 
(9) Ay = H+ Au) + iy 


which is, except for a minor shuffling of terms, the same as (3.57). The version 
involving (3.57) is usually preferred in elementary courses because the formulas 
involving Au, Av, and Ay flow more smoothly and quickly than those given above. 

19 Remark: As was said in passing in the text, discussions of names and sym- 
bols can be long and perhaps dismal. We call a rose ‘‘a rose” because everyone 
else does, and we do not need another reason. We call the number 


(1) lim fie tao — Ke) 


Az—0 


when it exists, “the derivative of f at x”? because everyone else does, and we do 
not need another reason. We can denote this number by f'(x) because everyone 
else does, and we do not need another reason. If we want to know what f'(x) 
means, we do not look at f'(x); we look at the definition of f'(x) and see that 


oy a p, Je +A) =f), 
(2) fe) = lim “A 
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We can observe that the Newton notation in the formula (2) uses functiona] 
notation in a thoroughly standard way; if f is a function and x is a number, 
then the right side of (2), when it exists, determines the value of the function 
f’ atx. If, for example, f'(x) = 2x for each real x, then f’(0) = 0, f’(6) = 12, 
f' (x?) = 2x2, and f’(sin x) = 2sin x. It could be presumed that one good symbol 
for the number in (1) should be enough, but itis not enough. Even if there were 
no other reason, we would still be required to know another symbol in order to 
be able to read scientific literature and to converse with scientists. We must 
know that we can set y = f(x), so that in a particular case we have y = x?, and 


d 
we can denote the derivative of f at x by the symbol Se If we want to know 


d d vo d 

what Zy means, we do not look at 5. we look at the definition of Z? and find that, 
dx dx dx 

in the particular case, 


(3) = = 2x. 


According to the definition, 2 is the derivative of f at x, and the meaning of 


dy. or . 
is not changed when we read ‘‘dee y dee x” or “the derivative of y with respect 


to x” or even “the derivative of y with respect to x at x.” The assertion (3) 
always means that the derivative of f at x is 2x, and weird ways of reading the 
assertion do not change the meaning of the assertion. The meaning of the 
assertion is not changed when we realize that a silly result is obtained by sup- 
posing that the d’s and the x and the y in (3) are numbers and canceling the d’s 
to get y/x = 2x. The meaning of the assertion is still unchanged when we 
realize that we never put x = 6 in the two members of (3) to obtain 


d 
(4) Z = 12. 
We do, however, allow ourselves the liberty of writing 


(5) dx? d 


— = 2x or — x? = 2x 
dx 


to abbreviate the statement that if y = f(x), where f is the function for which 
f(x) = x?, then the derivative of f at x is 2x. From a logical point of view, every- 
thing we have done can be summarized very simply. If we want to know the 
meaning of the word “quibble,” we do not look at the word “‘quibble”’; we look 


os ee , d 
at a definition. Let us then quit quibbling about the meaning of > We can 


conclude with a cheerful remark. Whenever we are likely to encounter diff- 
culties with the Leibniz notation, we can discard it and use the Newton notation. 


3.6 The chain rule and differentiation of elementary functions 
To be able to illustrate methods by which fundamental formulas for 
derivatives are used, we suppose that we know the five fundamental 
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formulas 
d d 
(3.61) a = nxn), 7, Sin * = cos x, 2 cos x = — sin x 
qd... d 1 
(3.62) T = g”, ax log x = 7 


only the first of which has been partially proved.t In the last two of 
these formulas, the base is ¢, the base of natural exponentials and loga- 
rithms, which appears in (3.272) and which will appear later. One of our 
tasks is to learn a procedure by which we can obtain a correct formula 
for dy/dx when y = sin uw and u is a differentiable function of x which is 
not necessarily x itself. The answer is 


dy _ du. 
(3.621) dx = COS u dx 


To see why this is so, and to see how many similar formulas can be 
obtained, we consider the general situation in which y is a function of u 
and u is a function of x, say y = f(u) and u = g(x). Then y is linked tox 
through the links of a short chain; x determines u and u determines y, so y 
is a function of x. While the operation may seem somewhat ponderous 
when 


(3.622) y = o(x) = f(g(x)) = sin g(x) = sin u = sin 2x, 


we can let ¢(x) = f(g(x)) and sketch the schematic Figure 3.63 which 
catches the functions g, f, and 4, re- 
spectively, in the act of transforming 
(or mapping or carrying) x into u, u 
into y, and x into y. The function ¢ 
for which ¢(x) = f(g(«)) is sometimes 
called a composite function. Figure 3.63 
The following theorem is the chain 

rule, which sets forth conditions under which y has a derivative with 
respect to x that can be calculated from the chain formula 


dy _ dy du 
dx  dudx 


(3.64) 


The result is given in terms of the “d”? notation of Leibniz and the 
“prime” notation of Newton, so that we can, in applications, choose the 
one that seems to be most convenient or informative. 


t These formulas will be proved in Chapters 8 and 9. A contention that we can and 
should learn and use these formulas before they are proved is pedagogically sound. It is as 
practical as the contention that embryonic electrical engineers should learn that copper 
wires conduct electric current, and use this information in various ways, before they study 
solid-state physics and learn mechanisms by which electrons travel along conductors and 
semiconductors. 
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Theorem 3.65 (chain rule) If f and g are functions such that g is 
differentiable at x and f is differentiable at g(x) and if we set y = f(u) and 
u = g(x) so that y = f(g(x)), then the chain formula 


(3.66) Z= OS = fe’ @) = ENA 


is valid at x. 
To prove this theorem, we use the notation of the theorem to obtain 


(3.661) Au = g(x + Ax) — g(x), Ay = flu + Au) — f(u) 


and observe that u and y are determined by x alone, while Au and Ay are 
determined by x and Ax. Consider first the usual case in which there is a 
number 6, such that 6: > 0 and Au = 0 whenever 0 < |Ax| < 6;. Then, 
when 0 < |Ax| < 41, we can write 


(3.662) AY = AvAu _ flu + Au) — flu) g@ + Ax) — el) 


Ax AuAx Au Ax 
and, after observing that 
(3.663) lim Au = lim 4“ ax = 240 = 0, 
Az—0 Az—0 Ax dx 


take limits as Ax approaches zero to obtain the required result. Because 
division by zero is taboo, exceptional cases are more troublesome. We 
can avoid this difficulty and handle all cases at once by setting 


_ Ay _ f(u + Ax) — fl) 
(3.665) @(Au) = a = f(u) (Au = 0). 
Then, whether Au is zero or not, we can write 
Ay _ Au _ g(x + Ax) — g(x) 
(3.666) An” (Au) Ax > (Au) Tie 


and take limits as Ax approaches zero to obtain the required result. 

The basic elementary functions can be separated into three classes. 
The first class contains powers and roots of x, that is, functions of the 
form x*, where a is a constant. The second class contains the six trigo- 
nometric functions and the six inverse trigonometric functions. The 
third class contains exponential functions of the form 8? and logarithmic 
functions of the form log; x, the base b being a constant. Thus there are 
just 15 types of basic elementary functions. The class of elementary 
functions includes the frightful function @ having values 


(3.667) (x) = LEC F) + [e* + (xt — 7x? + sin™ 3x) 


sin g% + gsin3z 4. x sin? 4x — cos x5 
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and all others obtainable by making ‘“‘finite combinations’? of basic 
elementary functions together with addition, subtraction, multiplication, 
and division. This class contains very many important functions. It is 
therefore important to know that we can work out a formula for the 
derivative of any given elementary function when we know (i) Theorem 
3.56, (ii) 15 basic formulas for derivatives of basic elementary functions, 
(iii) the chain rule, and, in addition, we possess (iv) a technique which 
enables us to apply these things. 

Of the 15 basic formulas, the most important 5 were listed at the 
beginning of this section and are relisted in the first column of the follow- 
ing little table. 


(3.671) yn = nt! oye = nui tt 
(3.672) + sin x = cos x Tx sin u = cos ù Ta 
(3.673) td cos x = — sin x gcos v = o Sin ua 
(3.674) bene d peo ott 
3.675) Slogx == d ogy = 1m 


If we know the first formula on the left, we can set y = u” and use the 
chain formula (3.66) to obtain the chain formula 


written opposite it. If we know the second formula on the left, we can 
set y = sin u and use (3.66) again to obtain the chain formula 


written opposite it. The same procedure shows that each basic formula 
has a chain extension. Of the ten basic formulas not listed above, four 
(which appear in Problem 11 of Section 3.5 and have probably been for- 
gotten) give derivatives of the last four trigonometric functions, and the 
remaining six give derivatives of the inverse trigonometric functions. 
Proofs of all of the formulas will appear later. Except for three formulas 
that are rarely used, the formulas are listed on the page opposite the 
back cover of this book. 

Our fund of information about logarithms is quite meager, but we can 
slowly add to it. We begin with the idea that log x exists (as a real 
number) only when x > 0. In case x < 0, log x does not exist but 
|x| > O and log |x| does exist. When x < 0, we can use the chain formula 


168 Functions, limits, derivatives 


to obtain 
1 d-«) _ 1, 


d d 
T log |x| = Tn log (—x) = ss 


Thus we can extend the two formulas in (3.675) to obtain the more 
general formulas 


d l d 1 du 


in which it is required that x + 0 and u = 0 but it is not required that x 
and u be positive. 

Up to the present time, our work with difference quotients and deriva- 
tives has involved only fundamental definitions and formulas. Figures 
and geometric ideas, which might be helpful but which might also be mis- 
leading, have been completely ab- 
sent. Section 5.1 will present our 
thorough introduction to matters 
relating to slopes of graphs and 
tangents to graphs. Meanwhile, we 
may be helped and may be unmisled 
by looking at Figure 3.68, which 
shows the graph C of a differentiable 
function f. The points P and Q hav- 
ing coordinates (x,y) or (x, f(x)) and 
(x + Ax, y + Ay) or (x + Ax, f(x + Ax)) are shown, but the line PQ 
joining P and Qis not drawn. The first of the two formulas 


Figure 3.68 


Ay _ f(x + Ax) — f(x) _ . 
(3.681) an = Ac = slope of line PQ 
(3.682) a = f'(x) = slope of tangent to C at P 


= slope of C at P 


is correct because it is obtained by applying the definition of the slope of a 
line. The second formula is correct by definition; the line through P 
whose slope is the limit as Ax approaches zero of the slopes in (3.681) is, 
by definition, the line tangent to C at P, and, moreover, the slope of C at 
P is, by definition, the slope of the line tangent to C at P. The definition 
gives precision to the venerable idea that the slope of the line PQ is close 
to the slope of the tangent at P whenever Q is close to P. The definition 
(3.682) turns out to be very important. Indeed, there are many situ- 
ations in which magnitudes play minor roles and it is important to know 
that the graph C of y = f(x) has a horizontal tangent (tangent of zero 
slope) at each point (x,y) on C for which f'(x) = 0, has a tangent of 
positive slope at each point (x,y) on C for which f'(x) > 0, and has a 
tangent of negative slope at each point (x,y) on C for which f'(x) < 0. 
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Problems 3.69 


1 Calculate f’(xo) and write the equation of the line tangent to the graph of 
y = f(x) at the point (%0,0) when 


(a) f(x) = x?, xo = 1 Ans.: y — 1 = 2(x — 1) 
(b) fæ) =x(1— x)  xo=0 Ans: y =x 
(c) f(x) = e, xo = 0 Ans.cy=xt) 
(2) fe) ==] Ans.: y =} 


2 Become thoroughly familiar with the following technique, because it 
enables us to do many chores quickly and correctly. Suppose we are required 


to find dy/dx when 
(1) y = sin 2x. 


We must realize that we are not required to differentiate sin x but are required 
to differentiate sin «, where v is a function of x. We look at (1) and read “‘y 
equals sine u” and realize without making a lot of noise and without writing 
anything that u = 2x. We then write dy/dx and say this is equal to cos u 
(write cos 2x) times du/dx (write 2). When we follow orders, we get 


dy 
(2) J 7 (cos 2x)2, 
but it is always better to put the answer in the neater form 


(3) 2 = 2 cos 2x 


which does not require parentheses. 
3 Read the equations 


(a) y = (x? + 1) (b) y = cos e? 
(c) y = sin (ax + b) (d) y = e7 
(e) y = cos ax (f) y = log (x? + 1) 


the way we read them when we want to find dy/dx. In the first case, we can 
tolerate “y equals uw to the nth” as a contraction of ‘y equals u with the exponent 
n” or “y equals u to the nth power.” In another case, we can tolerate “‘y equals 
e to the u,” which looks bad in print but is universally understood. Now, sup- 
posing that n, a, and b are constants, concentrate upon the task of learning five 
basic formulas and applying them to obtain the answers 


(a) 2 = 2nx(x? + 1)? (b) 2 = —e7 sin e? 
(c) 2 = a cos (ax + b) (d) 2 = gett 

dy , dy 2x 
(e) 5 = 4 sin ax O = ati 


Practice the technique until the answers can be obtained quickly and effortlessly. 
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4 Each of the formulas 
y = x? sin x, y = xe? 


can be read “‘y equals u times v.” Do this and obtain the derivatives 


. d 
3 = x cos æ + 2x sin x, T = Het + e, 


5 Each of the formulas 


can be read “y equals u over v.” Do this and obtain the formulas 


-m = rr es | —- —_ Se 


6 Derivatives of derivatives are called derivatives of higher order; the deriva- 
tive of f at x is f'(x), the derivative of f’ at x is f’’(x), the derivative of f” at x is 
f” (x) or f(x), and so on. Supposing that zis a number and 


fle) = = aHa, 
show that 
P(e) = -@+ x), PE) = Wea tx) fO@) = -3a + x)-+4, 
FO) = AIG +x) FOG) = —Sletx)§ fO@) = 6a + x), 
where 2! = 1-2, 3! = 1-2-3, 4! = 1-2-3-4, etcetera. Supposing as usual that 
0! = 1 and 1! = 1, observe that 
FM) = (—1)*al(z + x)! (n = 0,1,2,» 
when we agree that the result of differentiating f zero times is f itself. 
7 Letting f(x) = (1 — 2x)-}, show that 
f(x) = 2n!(1 — 2x) (n = 0,1, 2, +: °). 
8 Letting f(x) = log (1 + x?), show that 
r= fe) =qyem 
and calculate one more derivative. 


9 The formulas 


(a) sin (a + b)x = sin ax cos bx + cos ax sin bx 
(b) cos (a + b)x = cos ax cos bx — sin ax sin bx 
(c) eat)z = eer ebz 

(d) log ax = log a + log x ` 

are permanently remembered by all good scientists. For each formula, calculate 


the derivatives with respect to x of the two sides and show that the results are 
equal. 
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10 Supposing that a and w (omega, to keep physicists and engineers happy) 
are constants and 
Q = e* sin wt, 
show how the formula 


dQ 


J; e (w cos wt) + (sin wtjae” 


e(w cos wi + a sin wt) 
is obtained. Then let J = dQ/dt and show that 
dI 


z= e#(2aw cos wt + (a? — w?) sin wil. 


Remark: It is not necessary for us to know that, if a < 0, Q might be the charge 
on the capacitor of an LRC oscillator, in which case the electric current would be 
I and the voltage drop across the inductor would be the product of dJ/di and the 
inductance L of the inductor. It is, however, a good idea to know that the things 
we are learning are important in applied mathematics. 


11 Prove that 
i (==): 
dx e? + ez Ne + eo 
12 If, for a positive integer n, 


sin x sin 2x . sin 3x sin nx 


y(t) =o Fo ttt 


show that 
yn (x) = cos x + cos 2x + cos 3x + °° © + cos nx. 


13 Calculate f'(x) from the first and then from the second of the formulas 


1— 
f(x) = log i) f(x) = log |1 — x| — log |1 + xl. 


Make the results agree. Hint: Do not forget the second formula in (3.676); 
the derivative with respect to x of log |u| is (1/u) du/dx and the absolute-value 
signs quietly disappear. 

14 Observe that if y is a differentiable function of x, so also is the function 
F having values 


F(x) = x? + xy(x) + Iy). 
Tell precisely what formulas are used to obtain the formula 


F'(x) = 2x + xy’(x) + y(x) + 2y(x)y (x) 
or 
Fy 2 dy 
ay REG TY tD 
Ans.: The power formula, the formula for the derivative of a product, the chain 
formula, and the formula for the derivative of a sum. 
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15 Supposing that y is a differentiable function of x for which 
x? + xy(x) + [y(x)]* = 3, 


apply our fundamental formulas for calculating derivatives to obtain the formula 


na 2x tyl), 
r) = — FF Bye) 


Hint: Equate the derivatives with respect to x of the two members of the given 
equation. Remark: This process, by which we start with an equation involving 
y(x) and [without obtaining an explicit formula for y(«)] obtain an explicit formula 
for y'(x), is called “implicit differentiation.” To gain understanding of this 
terminology, we can note that the formula y = x + 1 says explicitly that y is 
x + 1 while the equation y — x — 1 = 0 only implies, and hence says tmplicitly, 
that yisx +1. Itis sometimes said that the equation x? + y? = 1 determines y 
implicitly, but the fact is that the equation does not determine y. Saying that “y 
is either 1/1 — x? or ~—+/1 — x?’ does not determine y any more than saying 
“a blonde did it”’ determines the culprit in a whodonit. 
16 Write the first displayed formula of the preceding problem in the form 


x? + xy + y? = 3 
and use the Leibniz notation for derivatives to obtain the formula 


dy x+y, 
dx x«x+2y 


Observe, however, that the calculation is illusory unless y is a differentiable func- 
tion for which the given equation holds. 


17 Clarify matters relating to the two preceding problems by showing that 
y is a differentiable function satisfying the given equation if 


—y — +/ _ 2 
y(x) = ET VE = 2) (—2 <x <2) 
and also if 


y(x) = met vee -e (—2 <x < 2). 


18 A graph of the equation 


x* + xy + y? = 3 


appears in Figure 1.592. Find the equations of the tangents to this graph at the 
two points for which x = 0. Be sure to obtain results that agree with Figure 


1.592. 


19 It is not a simple matter to “‘solve”’ the equation 


(1) yi ty =x 


fory. Ify isa differentiable function of x for which the equation holds, however, 
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we can differentiate with respect to x with the aid of the chain rule to obtain 


d d 
(2) EAEL o Wye) + yx) = 1 
and hence 

dy _il , 1 
6 TFI YO) = GOT 


From (3) and the assumption that y is a differentiable function of x, we see that 
the derivative itself is a differentiable function of x. The derivative of the 
derivative is called the second derivative and is denoted by the symbols in the 
left members of the formulas 


dy = Z0 rı — — 6y 
(9) d GF 1 O) = F T 


By differentiating the members of (3), show that the formulas (4) are correct. 
20 Supposing that y is a differentiable function of x for which the given rela 
tion holds, differentiate with respect to x to find dy/dx when 


la) xy =7 Ans: 2 = -ž 
(b) sin y = x Ans: 2 = or + aS 
() vax Ans.: 2 = ors 
(d) snxy=xt+y dns: Z= — pas 


21 ‘Find f'(x) when 


(a) f(x) = log (x + Va? + x?) Ans.: = - 
(0) H) = log (a? Fx? — x) Ans: 5 L = 
(c) f(x) = (log sin 2x)? Ans.: ECS 2x log sin 2% 
(d) f(x) = log (sin 2x)? Ans. t cos 2x 
(e) f(x) = log sin (2x)? Ans.: Sx cos t 
A o = (1) Ans.: "E + a- ee 


22 The Hermite polynomials, depending upon a parameter a that is usually 
taken to be 1 or 2 in applications, are defined by the formulas Ho(x) = 1 and 


dr 
(1) H,(x) = (1)? T gant 2 (n 2 1). 
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Show that if (1) holds, then 


~az? = n E —az?] 
(2) H,(x)e~o74/? = (—1)" a antl? 
, deti z 
(3) [—axHq(x) + Hp) = (—1)* E eee 
, i, drt 
(4) axHy(x) — Hi (x) = (~1)*+¢027/2 ai 27a22/2 
and 
(5) Haii(x) = axH,(x) — H(z). 
Use (5) and the fact that Ho(x) = 1 to obtain the formulas 
Ho(x) =] 
H (x) = ax 


H(x) = ax? — a 

H3(x) = ax? — 3a% 

H,(x) = atx4 — 6a3x? + 3a? 

H;(x) = a5x5 — 10atx? + 15a% 

Ho(x) = ax — 15a5x4 + 45a4%? — 1528 
Hi(x) = atx? — 21a%&> + 105a5x3 — 105a4x. 


23 The Laguerre polynomials are defined by the formulas Lo(x) = 1 and 


L(x) = ee (x"e*) (n = 1,2,3, - - -). 
Show that 
L(x) =] 
Li(x) = ~x +1 
L(x) =x? — 4x +2 
L(x) = —x3 + 9x? — 18x + 6 
La(x) = x4 — 16x3 + 72x? — 96x + 24. 


24 Supposing that y = "2t or h(t) = e»t, use the chain rule and the formula 
for derivatives of products to obtain the first three derivatives with respect to 
t of these things. Ans.: 


dy 


(1) i h'(t) = eint cos ż 
d 
(2) “2 = h(t) = —etin? sin t + eist cos? t 
dèy Ir : . ° . 
(3) J ~ k” (t) = — eat cos t — 3eint cos ¢ sin t + eint cos? z. 


25 Assuming ezistence of all of the derivatives we want to use, show that if 
h(t) = f(g(@)), then 
(1) KE) = Fe E) 
(2) h) = FE O + F (e@)le’!? 


and write a formula for h’”(z). Then show that these formulas reduce to those 
of Problem 24 when f(x) = e and g(t) = sin £. 
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26 The preceding problem involved three functions and the Newton notation 
for derivatives. This problem requires use of the Leibniz notation. Supposing 
that ‘“‘y is a function of x and x is a function of ż so y is a function of #,”” and that 
each function has three or more derivatives, write formulas for the first three 
derivatives of y with respect toz. Finally, check your answers against those of 
the preceding two problems. Partial ans.: 


dy _ dy dx 
(1) dt dx dt 

dy __ ix | diy zy 
(2) di? © ded T dx®\ai 

dèy _ dy d*x d*y d°x dx dx 
(3) Je T ade + OTe a +2 a dt ak 


27 Suppose we momentarily agree that the first of the formulas 


dy _dydx dy én) , 
T RP T B (?) 


is true ““‘because’’ we get a correct result by canceling dx’s from the right side. 
Show that we should not apply the same “reasoning” to the second formula. 

28 Read Theorem 3.65 and observe that the hypotheses are satisfied if 
f(x) =1+% +x, g(x) = 0, and u = g(x) = 0 for each x, while y = f(u) for 
each u so that y = f(g(x)) = 1 foreach x. Hence the conclusion of the theorem 
implies that 


for each x. Observe that dy/dx = 0 and du/dx = 0 for each x. Our major 
question now appears. Is there a reason for uneasiness about the meaning of 
dy/du when u = Oforeach x? Remark and ans.: This question was raised by an 
extremely sane person who happened at the moment to be thinking too much 
about the manner in which we read dy/du and too little about the meaning of 
dy/du. According to our basic definition, dy/du is f’(u), the derivative of f at 
u. Since f(x) = 1 + x + x? for each x, we find that f'(x) = 1 + 2x for each x, 
so f'(u) = 1+ 2u for each u. Thus, dy/du = 1 + 2u. If it happens that 
u = 0 for each x, then dy/du = 1 for each x. We have no reason to be uneasy 
unless we manufacture trouble by recreating old tales about varying variables 
that we sometimes call galloping numbers. The notation of Leibniz is often 
more convenient than that of Newton, but it is also more likely to engender 
mental aberrations. Nobody expects u to be galloping around while we calculate 
f(x). 

29 Is the function f for which f(x) = |x| an elementary function? Remark 
and ans.: This is a tricky question. An intelligent perso d make an incor- 
rect guess until he discovers or is reminded that |x| = ral The function f is 
elementary, but /’(0) does not exist. 

30 Let p and q be positive integers. Let y(0) = 0 and let 


y(x) = xP sin = (x x 0). 
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Show that, when x = 0, 


_ 1 4. 1 
y'(x) = — grr! cos ~g + px? sin a 


Tell why this formula cannot be valid when x = 0. Then show with the aid of 

the sandwich theorem 3.287 that if p 2 2, then y’(0) = 0. Show that if p > 

q + 1thenlim y’(x) =0. Showthatifp £ g + 1, then lim y’(x) does not exist. 
x0 2— 


31 Before starting this problem, we make the profound observation that 0 
times a number is 0 but that nobody ever tries to define the product of 0 and 
something that does not exist. With this in mind, show that the first of the 
formulas 


(1) EA E EI 

is valid for each x and that the second is valid if, and only if, x #0. Hint: For 
the first part, observe that |x|? = x?. For the second part, consider separately 
the cases for which x > 0, x <0, and x =0. Remark: Putting f(x) = x? and 
g(x) = |x| shows that g'(x) can fail to exist even when it is known that df(g(x))/ 
dx exists. The calculations in 


d 
(2) p(x) = f(g)), pE) = Fel), 2 = ac 


may therefore be incorrect even when ¢ and f are both differentiable. In any 


case, we are not doing rigorous mathematics when we start with the first of the 
formulas 


(3) sin y = x, COs y 2 =] 


and obtain the second without giving a thought to the question whether y is a 
differentiable function of x. Congratulations can therefore be showered upon 
students who, at this time, have a healthy lack of enthusiasm for problems like 


Problem 20. 


3.7 Rates, velocities Let f be defined over some interval a S x < b 


—i- = 


and let y = f(x). When x and x + Ax both lie between a and b and 
Ax =Æ 0, the difference quotient in 


(3.71) A -fe + Az) — f(x) 


is the average rate of change of y with respect to x over the interval from the 
lesser to the greater of x and x + Ax. If this average rate (which is the 
difference quotient) has a limit as Ax approaches zero, then this limit 
[which is the derivative dy/dx or f'(x)] is the rate of change of y with respect 
to x at the given x. These are definitions which can, perhaps without 


3.7 Rates, velocities 177 


disastrous loss of meaning, be abbreviated to the forms 


(3.72) Average rate = difference quotient = Ay = fœ + 2x) — f(x) 


(3.73) Rate = derivative = 2 = f'(x). 


Of course, we are never required to prove definitions, but these are 
important and we must have or acquire an understanding of them and a 
feeling that they do (or do not) use words of the English language in a 
reasonably appropriate way. Shifting the letters from y and x to x and t, 
we see that the definition involving (3.72) shows that if x is a number of 
miles and ż is a number of hours, then the average rate of change of x 
with respect to ż is a number Ax of miles divided by a number At of hours 
and hence is a number of miles per hour. Some applications of this are 
very simple and agree with all primitive ideas about rates. When we are 
thinking about a particular automobile journey in which the automobile 
moves steadily in one direction along a straight road, we can let x and 
f(t) denote the distance (number of miles) traveled during the first ¢ hours 
of the trip. We are all accustomed to calculating the average rate over a 
given time interval and to calling this average rate an average speed. 
Suppose now that an untutored (but not necessarily stupid) individual is 
asked how he might, without looking at a perfect speedometer, determine 
a number Q which could reasonably be called the speed at a particular 
time ¢. His reply might be lengthy and partially intelligible. He 
should, sooner or later, arrive at the idea that the average speed over a 
long trip is likely to be a very bad approximation to Q, but that the aver- 
age speed over the time interval from ¢ to ¢ + At (or from ¢ + Az to żin 
case At < 0) should be near Q whenever Az is near 0 but Aż #0. We 
have learned how to make this idea precise. Itis done in the definitions 
we are discussing. Similar stories involving other rates (degrees centi- 
grade per centimeter, coulombs per second, and dollars per year, for 
examples) show that the definitions are sensible and should have swarms 
of important applications. 

Our simple discussion of the journey of an automobile moving steadily 
in one direction along a straight road involved the word “speed”? but 
carefully avoided the words “velocity” and “acceleration.” To appre- 
ciate what is coming, we should know some history. The words “speed,” 
“velocity,” and “acceleration” are very old. A long time ago, say before 
the year 1900, they were all numbers (scalars); velocity and acceleration 
could be negative but speed never could be. Nowadays, in all enlight- 
ened communities, velocities and accelerations are always vectors and 
we must learn about them. To get started, we consider the path traced 
by a bumblebee (or molecule or rocket or satellite or what not) as it 
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buzzes through space F3. While other tactics are both possible and use- 
ful, we suppose that we have a rectangular x, y, z coordinate system 
bearing unit vectors i, j, K as in Section 2.2. At each time ż, the coordi- 
nates of the bumblebee can be denoted by x(t), y(t), z(t). Letting r(z) 
denote the vector running from the origin to the bumblebee, we obtain 
the vector equation 


(3.74) r(t) = x()i + yj + 2@)K. 


This vector r(t) is called the displacement (or displacement vector) of the 
bumblebee at time t. Supposing that Az = 0, we can write 


(3.75) r(t + At) = x(t + Athi t+ y(t + At)j + z(t + Atk 
and form the difference quotient 


r(¢ + At) — r(t) _ a(t + At) — x(t) i+ y(t + At) — y(t) ; 


At At At 
z(t + At) — z(t) 
+ — a k 
which can be written in the abbreviated form 
Ar Ax, , Ay, , AZ 
(3.751) a Ait + Ais + ras 


In accordance with general terminology, this difference quotient is called 
the average rate of change of the vector r(#) with respect to ¢ over the 
interval from the lesser to the greater of t and t+ At. It is also called 
the average velocity of the bumblebee over this same interval. In case 
the above difference quotients have limits as At—> 0, the limit of the 


average velocity is called the velocity at time ¢ and is denoted by v(t). 
Thus 


(3.76) v@) =r =x Hi + yO + 2’ Ok 
Or 
_ dr _ dx dy. , dz 


provided the derivatives exist. Figures 3.762, 3.763, and 3.764 show how 
the vectors Ar, Ar/At, and v(t) might appear in a particular example. 


Figure 3.762 Figure 3.763 > Figure 3.764 


x(t+ Af) r(é+ Az) Ar 
At 


v(t) 
r(t) Ar r(t) r(t) 
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The scalar components of the velocity v or v(t) are sometimes denoted 
by the symbols vz, Vy, vz, so that 


! d P d 
(3.765) os = x'(1) - &, vy = y(t) = J = z(t) =F 
and 
(3.766) v = vÀ + oj + ok. 


The acceleration a(t) is a vector which is defined in terms of velocities 
in the same way that velocities are defined in terms of displacements. 
Thus, provided the derivatives exist, 


(3.77) a(t) = wt) = r” (t) = x” i + y” Oj + 2’Ok 


or 


ON dr _ ae, dy, day 
(3.771) a(t) =h 7 ie ~ ar + Je) j] + = 73 £ 


where the “‘double prime” in (3.77) and the number 2 appearing in “‘dee 
squared x dee ¢ squared” in (3.771) denote second derivatives, that is, 
derivatives of derivatives. We still have to learn what is meant by the 
speed (a scalar) of the bumblebee. It is defined by 


(3.78) Speed = length of velocity vector, 


so that, in our notation, 


(3.781) Speed = |v(z)| = \(2) + (2) 4 (4). 


Perhaps it should be explained that the ¢ appearing in the above equa- 
tions is called a parameter, that (3.74) is a parametric equation of the path, 
and that the path is the graph of the parametric equation. According to 
this definition, a parameter isa number. It is an element of the domain 
of the functions in (3.74), and we need not complicate our lives by harbor- 
ing impressions that parameters are complicated things. Section 7.1 
gives a careful explanation of circumstances in which the graph is called 
a curve. 

In Section 5.1 we shall give a rather detailed discussion of tangents to 
graphs. Meanwhile, it can be remarked that if the vectors in (3.751) 
and (3.761) are not 0, then the line through P(x,y,z) and P(x + Ax, 
y + Ay, z + Az) is called a chord of the curve being considered, and the 
line through P(x,y,z) having the direction of v(t) is called the tangent to 
the curve at P(x,y,z). Therefore, we can find the direction of the tangent 
to a sufficiently decent curve by finding the velocity of a particle which 
moves along the curve with nonzero velocity. The tangent line and the 
velocity vector have, by definition, the same direction. To bridge the 
gap between our work and plebeian terminology used in the prosaic 
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workaday world, we need still another definition. When we say that a 
moving body is, at time ż, “going in the direction of a vector W” we mean 
that its velocity v has the direction of w, that is, v = kW, where isa 
positive scalar. In case W is a unit vector, the scalar & is the speed of the 
body. It is not so easy to tell what the body is “doing”? when v = 0 or 
v does not exist. The ancient Greek philosophers tried to make people 
think about motion, and we never quite know how much they smiled 
when they insisted that an arrow cannot move where it is and cannot 
move where it isn’t and, hence, cannot move atall. Thoughts about such 
matters can bring the conviction that definitions are not superfluous. 

A few simple observations should be made. In case the bumblebee 
buzzes around in a plane which we take to be the xy plane, the above 
story is unchanged but calculations are simplified by the fact that z(t) = 0 
for each t. In case the bumblebee buzzes around in (or on) a line which 
we take to be the x axis, we have y(t) = z(t) = Ofor each ż. In modern 
terminology, scalars cannot be velocities but can be scalar components 
of velocities. In case a particle moves on a coordinate axis or on a line 
parallel to a coordinate axis, its velocity and acceleration are still vectors 
but their scalar components in the direction of the axis are scalars which 
we shall call the scalar velocity and scalar acceleration of the particle. 
For example, if a particle is moving on an x axis in such a way that its x 
coordinate at time ż is the scalar (or number) 


x= 48+ Be2+C:+ D + Esin at, 
then the scalar (or number) v (not v) defined by 


v = Z = 34 + B+ C + Eo cos wt 


is its scalar velocity at time ż and the scalar (or number) a (not a) defined 
by 

d*x , 

a= => = 64t + 2B — Ew? sin wt 

dt? 
is its scalar acceleration at time ż£. The speed is |dx/dt|. When the posi- 
tive x lies to the right of the origin, the particle is “moving to the right” 
at those times for which dx/dt > 0 and is “‘moving to the left” when 
dx/dt <0. It is not so easy to tell what the particle is “doing” when 
dx/dt = 0 or dx/dt does not exist. 


Problems 3.79 


I A stone is thrown downward 10 feet per second from the deck of a bridge. 
The distance s it will have fallen ¢ seconds later is assumed to be 


s = 10t + 1622. 
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Supposing that 0 < ¢ < ¢ + At, work out a formula for the average speed of the 
stone over the time interval from zto¢-+ Az. Then work out a formula for the 
speed at time z. Ans.: 10 + 322 + 16 At and 10 + 322. 

2 A vertical y axis has its positive part above the origin. A particle moves 
upon this axis in such a way that its coordinate at each time ¢ is 


y = —AP+ Bt +C, 
where 4 is a positive number. Show that the scalar velocity v is 
v= —24t + B, 


that the particle is going up when ż < B/24, and that the particle is going down 
when t > B/24. Show that the scalar acceleration is always —24. Show that 
the greatest height attained by the particle is B2/44 + C. 

3 In the context of the preceding problem, so determine 4, B, and C that 
the scalar acceleration is always — 32 and the particle is 3 units below the origin 
and going upward with speed 8 when ¢ = 0. 

4 A particle moves along the x axis in such a way that its x coordinate at 
time ż is 


x = 2t5 — 524 — 22 — 2+4 1. 
Find its scalar velocity and scalar acceleration at time t. Ans.: 
10:4 — 2023 — 4t — 2, 4023 — 6022 — 4. 


5 A body moves on a line in such a way that its coordinate x at time ż is 
{3 
w= a — 4% + 1st t 6. 


Find the time interval over which the scalar velocity is negative, and find the 
distance the body moves during the interval. Ans.: $. 
6 A particle moves along an x axis in such a way that, when ¢ 2 0, its 


coordinate is 
x = V kt? + 62, 


where k and ¢ are positive constants. Show that its speed is always less than 
k and approaches & as t becomes infinite. 

7 Ifan oak tree in Ohio was 20 feet tall when it was 15 years old and was 
36 feet tall when it was 25 years old, the average rate of change of height (meas- 
ured in feet) with respect to time (measured in years) over the 10-year interval 
is 1.6 feet per year. Tell why it is not reasonable to suppose that the tree grew 
steadily at the rate of 1.6 feet per year for 10 years. If the tree grew from height 
30 feet to height 32 feet in a calendar year from January 1 to December 31, sketch 
a reasonably realistic graph which shows how the height of the tree might depend 
upon ¢ during the year. 

8 The charge Q (measured in coulombs) on an electrical capacitor at time 
t is Qo sin wt, where Qo and w (omega) are constants. The rate of change of Q 
with respect to ¢ (measured in coulombs per second, that is, in amperes) 1s the 
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current J in the circuit containing the capacitor. Find a formula which gives / 
in terms of ¢. 

9 This problem involves uniform circular motion. Let a particle P start 
at the point (2,0) of the plane Figure 3.791 and move around the circle with 
center at the origin and radius a in such a way that 
the vector OP rotates at the constant positive rate 
w (omega) radians per second. Letting r = OP, 


show that 

(1) r= a(cos wii + sin wij) 
(2) v= aw(—sin wii + cos wij) 
(3) a = —aw*(cos wii + sin wij) 


and hence that 


Figure 3.791 


(4) a= —aw u, 


where, at each time ż, u is a unit vector running from the origin toward P. 
Show that r-v = 0 and interpret this result. Show that |v| = aw and interpret 
this result. Remark: The result (4) is important in physics. It says that, in 
uniform circular motion, the particle is always accelerated toward the center and 
that the magnitude of the acceleration is aw?. Some additional terminology 
should be encountered frequently and slowly absorbed. When a particle moves 
upon a line in such a way that its coordinate at time t is 4 + B sin (wt + 9), 
the motion is said to be sinusoidal or (particularly in old books) harmonic or 
simple harmonic. The numbers ¢, w/2r, and B are the phase, the frequency 
(cycles per unit time), and the amplitude of the motion. Glances at the compo- 
nents of r and a in the above formulas show that the projection of P upon a 
diameter (line, not number) of the circle executes sinusoidal motion. More- 
over, the projection is always accelerated toward the center, and the magnitude 
of the acceleration is proportional to the distance from the center. See also 
Problem 16. 

10 Asin the text of this section, let a particle move in E; in such a way that 
its displacement, velocity, and acceleration are 


(1) r) = xi + yj +2(@)k 
(2) ve) = xi +y Oj + 2’@k 
(3) alt) = x” 0i + y” Oj + z” (e)k 


and the square of its speed is 


(4) OL = KOR + fy’ @? + k'O. 


Using this information, prove that if the particle moves with constant speed c, 
then the acceleration is always orthogonal to the velocity. Hints: Do not get 
scared. Equate the right member of (4) to c?. Equate the derivatives of the 
members of your equation. Look at your result. Remark: One who thinks that 
this result is mysterious should remember or discover which way he tends to 


topple when he sits in an automobile which rounds an unbanked curve at con- 
stant speed. 
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11 This problem involves the uniform circular helical 
motion of a particle Q in £3 which runs up the helix (spiral 
staircase) of Figure 3.792 in such a way that its projection 
P upon the xy plane executes the uniform circular motion 
of Problem 9 whiles its z coordinate increases at the posi- 
tive rate b units per second. Supposing that Q occupies 
the position (4,0,0) when ¢ = 0 and lettingr = OQ, show 
that 


r= a(cos wii + sin wij) + btk 
v= aw(—sin wti + cos wij) + bk 
a = —aw*(cos wti + sin wij). Figure 3.792 


Find the speed of Q. Remark: One who gets interested in this helix may try 
to find the length of one turn by two different methods. First, use the speed of Q 
in an appropriate way. Second, find out what happens when the cylinder upon 
which the helix lies is cut along a vertical generator and rolled out flat. 

12 A projectile P moves in such a way that its displacement vector at time 
tis 7 
(1) r = (vo cos a)ti + [(v sin a)t — Fe2°]j, 


where a, vo, g are constants for which O < a < 1/2, vo >0,g>0. Show that 
its velocity at time ¢ is 


(2) v = [(v cos æji + (vo sin a)j] — géj. 
Show that a = —gj. Show that the coordinates x, y of P at time ¢ are 
(3) x = (a cos a)i, y = (v sin a)t — $gt?. 


Eliminate ¢ to obtain the equation 


~ so _ gx? 

(4) y = (tan a)x — zi oia 

7 rae Yq COS? a 
a4 


and note that the path of the projectile is a part of a parabola. Show that 
y = 0 when ¢ = 0 and that the projectile is then at the origin. Show that y = 0 
when ¢ = (2v sin a)/g and that the projectile is then at the point (R,0), where 


. Q . 
29% sin æ cosæ vo sin 2a 


£ £ 


(5) R 


This number R is called the range of the projectile, and this range is clearly a 
maximum when sin 2a = 1 and hence when 2a = 7/2 anda = 7/4. Show that 
the initial velocity (velocity at time ż = 0) is 


(6) (v9 cos æji + (vo sin œ)j 


and that this makes the angle a with the positive x axis. Show that the initial 
speed is ø. Tell, in terms of vectors, how the velocity at later times is related 
to the initial velocity. Find the velocity (not merely speed) of the projectile 
when it hits the point (R,0). 
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13 While the matter must remain mysterious until some mathematica] 
secrets have been revealed, the tip P of a cog of a particular hypocyclic gear 
moves in such a way that its displacement vector at time ¢ is 


r = a(cos? wti + sin? wtĵ), 
where a and w are positive constants. Find and simplify formulas for its velocity, 
speed, and acceleration. Ans.: 


v= oe sin 2wt(—cos wti + sin wtj) 


3 e 
Speed = > [sin 2we| 
2 
a= > sin 2wt(sin wti + cos wij) + 3aw? cos 2wt(—cos wti + sin wtj). 


14 A particle P moves in such a way that its displacement vector at time ż is 


22 , l, 
r=nyiitpry 


Show that |r| = 1 at all times and hence that the path of P must lie on the unit 
circle with center at the origin. Ifa particle moves on a circle in such a way that 
it has a nonzero velocity vector v, then v must be tangent to the circle and hence 
orthogonal (or perpendicular) tor. Check this story by calculating v and show- 
ing that vr = 0. Find the times at which the particle crosses the coordinate 
axes, and then obtain more information about the motion of P. 

15 Prove that if a particle P moves in Esin such a way that it has displace- 
ment vectors and velocity vectors r(z) and V(t) at time ż, then 


d r(t)-v(t) 
J Ir@| =- MOJ 


when the particle is not at the origin. Tell why this implies that if the path of P 
lies on a sphere with center at the origin, then v must be normal (or orthogonal 
or perpendicular) to r and hence v must be tangent to the sphere. Remark: 
It can be presumed that we do not known much about curves and surfaces in 
Es, but we can presume that if a particle P makes a decent trip along a decent 
curve lying on a decent surface, then at each time the velocity vector having 
its tail at P must be tangent to the surface as well as to the curve. 
16 Ifa particle moves along the x axis in such a way that, at time ż, 


x = asin (wt + ¢) 


where a, w, @ are constants for which a > 0 and w > 0, the particle is said to 
describe (or execute) sinusoidal (or harmonic) motion. Calculate the first and 
second derivatives of x with respect to ¢ and show that 


oe Gy? 
Remark: This shows that the scalar acceleration of the particle is proportional to 


the scalar displacement of the particle from the origin (or equilibrium position) 
about which it oscillates. 
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17 A particle P moves in £3 in such a way that the vector r running from the 
origin to P is 
r = r[sin 6 cos di + sin 0 sin j + cos Ok], 
where r, 0, and ¢ are all differentiable functions of t. Find the velocity and 
speed of the particle at time t. Ans.: 


r(t) = r (u + r(t)0’(t)v + r(t)’(£) sin Ow 


where u = sin 6 cos di + sin 0 sin j + cos 6k 
v = cos 8 cos di + cos @ sin ¢j — sin 6k 
— sin di + cos oj 


4 
Il 


and 


rOl = {ir + ree] + [rp E) sin 6]2}%. 


Remark: Problem 5 of Problems 2.69 shows that the vectors u, v, and W are 
orthonormal. The numbers r, ¢, and 0 are spherical coordinates which appear 
in Figure 10.12 and are studied in Chapter 10. When ais a positive number and 
r = a for each ż, P is always on a sphere and the above formulas become the 
standard formulas used for study of curves that lie on spheres. Chapter 7 gives 
solid information about curves. 

18 A spherical earth has its center at the origin and has radius a. A particle 
P moves on the surface S of the earth in such a way that the vector r running 
from the origin to P is 


(1) r = alsin 6 cos di + sin 0 sin @j + cos 6k] 


where 6 and ¢@ are differentiable functions of ¢ Show that the velocity of P 
at time ¢ is 


(2) r’(t) = a6’(t)v + ad’(t) sin Ow 
where 
(3) v = cos l cos di + cos @ sin ¢j — sin 0k 


(4) w = —sin di + cos @j. 


Remark: We invest a moment to look at some facts involving compass directions. 
When 0 < 6 < r so that P is neither at the north pole nor at the south pole, the 
vector V points south from P and the vector w points east from P. When 6 and 
@ are so related that, for some constant q, 


(5) $'(z) sin 6 = q0’ (6), 


the vector r’(t) and the path of P always make the same constant angle with w 
and hence always have the same compass direction. The path of P is then said 
to be a rhumb curve, or loxodrome. Such curves are followed by ships that keep 
sailing northeast. When we have learned more calculus, we will be able to show 
that (5) holds if and only if there is a constant ¢ for which 


l — cos 8 


(6) $ = q log — g t° or @ = q log (csc 0 — cot 0) + c 


or $ = q log tan $ +. 
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19 The vector formula 
r = (b + a cos 6)cos di + (b + a cos 6)sin dj + a sin 0k 


of Problem 22 at the end of Section 2.2 provides the possibility of studying curves 
on a torus. Supposing that 6 and ¢ are differentiable functions of ż, find r’(z). 
Ans.: 


r'’(t) = a6’(t)[— sin 8 cos gi — sin 0 sin j + cos 6k] 
+ (b + a cos 6)¢’(t)[— sin di + cos gj]. 


20 The rod OP of the linkage of Figure 3.793 has length a and has one end 
fixed at the origin O. The rod QP has length b. 
Its lower end moves to and fro on the x axis in 
such a way that its x coordinate is c + sin wt at 
time ¢. Its upper end is fastened to the first rod 
O e+sin wt Q at P, and the motion of Q causes the first rod to 
Figure 3.793 rotate. Write the formula (the law of cosines) 
which expresses 5? in terms of other quantities, and 
differentiate to obtain a formula for d0/dt. Then use the formula 


r = OP = a (cos 0i + sin 0j) 


to obtain a formula for the velocity v of P. Ans.: 


V = aw [a cos 0 — (e + sin wi)} cos wf ai {ES ai) cos et (—sin ĝi + cos 0j). 

21 A cam can furnish us something to differentiate. A circular disk of radius 
a is mounted on a cam shaft at O. 
Supposing that C is the center of the 
disk and that 0 < b < a, let |OC| = b. 
The eccentric disk rotates about O with 
constant angular speed, the angle POC 
being wi. The mechanism to the right 
of the disk in Figure 3.794 keeps the 

. oint P(x,0) of a rod pressed against 
Figure 3.794 the rotating disk so that P(x,0) moves 
to and fro on the x axis as the disk rotates. The formulas 


(1) OB = b cos wti, BC = b sin wij, |BP| = Va? — |BC|? 
show that 
(2) r = [b cos wt + vV a? — b? sin? wili, 


where r = OP = OB + BP. Find the velocity v and the acceleration a of P, 


and do not spend all day trying to discover the significance of the fact that (2) 
reduces to 


(3) r = alcos wt + |cos wil]i 


when b = a. 
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22 Let a particle of mass m move in a vertical plane in such a way that its 
coordinates x, y are differentiable functions of ¢ (time) and the vector r(t) running 
from the origin to the particle at time ¢ is 


rit) = x(Z)i + y). 
The kinetic energy £, of the particle can then be calculated from the formula 
E = $m|v(£)|2. 


Assuming that a constant gravitational force —mgj is exerted upon the body, 
we can calculate its gravitational potential energy Ez from the formula 


Eo = mgy(t). 
In appropriate circumstances, the number £ defined by 
E=E,+£; 


is the total energy of the particle. For present purposes we do not need basic 
information about these things, but we should know enough calculus to be able 
to calculate the total energy at time ¢ of a projectile of mass m for which 


r(t) = coi + (cut — Het?)j. 


Do it. Ans.: E = $m(ch + ct). Remark: The fact that E has the same value 
at all times is no surprise to persons who know about “‘conservation of energy.” 
23 Supposing that 


(1) T(t) = p(cos ġi + sin ¢j), 


where p and ¢ are functions of ¢ having two derivatives, differentiate to obtain 
r’(z) and then differentiate again to obtain r” (t). Ans.: 


d? ado \? 
D ro = [SP — o( GF) | (cos ei + sin gi) 


dpd d © ge 
+ [22 +o 5S] (sin i + cos gi). 


24 Give a full statement of reasons why it is true that if p and ¢ are functions 
of t such that p has one derivative and ¢ has two derivatives, then the product 


„d 
p? 2è has one derivative and 


d ,dġ_ „ dpdọ ,d% 
ne u Pana +? aE 


25 We can pick up assorted ideas by thinking about income tax rates. Let 
I(x) denote the income tax which T, a taxpayer in a particular class, must pay 
when his net taxable income is x. Of course x and I(x) are to be measured in 
appropriate units such as dollars or marks or kilobucks. A government may 
decree that if xı S x S xo, then I(x) is yı plus kı per cent of the excess of x over 


x; We can simplify this by letting mı = k:/100 and writing 


I(x) = yı + m(x — xı) (xı S x S x2). 
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Our knowledge of the natures of graphs of 
lines now shows that the graph of y = I(x) 
over the interval xı S x S x2 is a line seg- 
ment having slope mı. Figure 3.795 shows 
the graph of y = I(x) that can be constructed 
from information given in a possible tax 
table. Everybody says that tax rates are 
different in different tax brackets, being for 
example m, (or 100m, per cent) when 
Figure 3.795 x, <x <x“. Our fundamental problem js 

the following. Show that if xı < x < x, 
then our definition of a rate as a derivative is in agreement with the things that 
have been said about tax rates. Show that when x is xı, the tax rate does not 
exist but that the “right-hand rate” and “left-hand rate”? do exist. 


Xo Xi Xa Xg X4 


3.8 Related rates Useful information about derivatives and their 
applications can be gained by solving problems more or less like the fol- 
lowing one. Figure 3.81 represents a ladder which is 10 units (feet or 
meters) long. The top of the ladder rests against a vertical wall and is 
8 units above the horizontal floor upon which 

the bottom of the ladder rests 6 units from 

l the wall. It is supposed that the bottom of 

h the ladder is moving away from the wall at 
the rate of 2 units per second, and we are 
required to find the rate at which the 
invisible man at the top of the ladder is 
Figure 3.81 Figure 3.82 plunging earthward. To solve this problem, 
we begin by constructing the more propitious 

Figure 3.82 in which the ladder still has length 10 but x and y are varia- 
bles which (unlike 8 and 6) can have different values at different times ż 


when the ladder is skidding. The variables x and y are related by the 
formula 


(3.83) x? + y? = 100. 


In order to obtain a relation involving dx/dt (the rate at which the bottom 
of the ladder is moving away from the wall) and dy/dt (the rate at which 
our man is moving upward), we need the fundamental idea that we should 
consider x and y to be functions of ż and differentiate with respect to t. 


Equating the derivatives of the members of (3.83) and dividing by 2 
gives the formula 


dx dy _ 


which relates the related rates dx/dt and dy/dt. Putting x = 6, y = 8, 
and dx/dt = 2 shows that dy/dt = —3. This shows that our poor 
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man is rising —$ units per second and is therefore falling $ units per 
second, and our problem is solved. It can be insisted that our solution 
of the problem would have been more easily understood if we had used 
the more elaborate symbols x(t) or fı(t) and y(t) or f(t) instead of x and 
y to denote distances. Thus we could have written 


[~(2)]? + [y@]? = 100, x(z)x"(t) + yy) = 0 


AOP + F0] = 100, AOA +O = 0 


instead of (3.83) and (3.84). One who wishes to do so may insist that 
(3.83) and (3.84) abbreviate more meaningful formulas just as the sym- 
bols AA and AAA abbreviate Alcoholics Anonymous and American 
Automobile Association. It is, however, required that we learn the 
abbreviations to expedite our work and to enable us to understand others 
who use the abbreviations. 

When we are interested in problems involving rates of change of the 
volume V and the radius r of a sphere, we start with the formula 


(3.85) V = Grr or V(t) = $r[r(t)]?. 


Supposing that V and r are differentiable functions of time t, we can 
differentiate to obtain 


(3.851) — = hrr 


or 


When numerical values are assigned to two of the three quantities r, 
dr/dt, dV /dt, we can solve (3.851) for the remaining quantity. 

We need very little information about the external world to appreciate 
the idea that if an appropriate piston is pushed into a closed cylinder con- 
taining a gas, then the volume V of the confined gas will decrease and 
the pressure p exerted by the confined gas will increase. In appropriate 
circumstances, calculations can be based upon the formula 


(3.86) pV =c, 


where p and V are differentiable functions of ż and cis a constant. Dif- 
ferentiation with respect to ż gives the formula 


(3.861) pyro, 


which involves four numbers. When three of these numbers are known, 
we can calculate the remaining one. 

If a particle is moving along the graph of the equation y = x? in such 
a way that its coordinates x, y are differentiable functions of #, then 


(3.87) — = lx 
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When two of the three numbers in this formula are known, we can calcu- 
late the remaining one. Some more or less instructive problems involv- 
ing such motions require use of the formula 


(3.871) |v] = (#) + (2) 


for the speed of the particle. When motions in Æ; are to be investigated, 
it may be advantageous to use the vector formulas 


(3.88) r= xi + yj + ck 
_ dx, , dy, , dz 
(3.881) v=sit+ sit7k 


in which x, y, z represent the coordinates of the particle at time ¢t. Of 
course, this motion reduces to motion in the xy plane when z = Q for 
each t. 


Problems 3.89 


I. Asin Figure 3.891, a rope 13 feet long extends from a boat to a point on 
a dock 5 feet higher. A man on the dock is pulling rope in at the rate of 72 feet 
per minute. How fast is the boat moving? 
Ans.: 78 feet per minute. 

2 A light atop a pole is H feet above a level 
street. A man A feet tall walks steadily, F feet per 
second, along a line leading away from the base of 
the pole. At what rate is the tip of his shadow 
moving when he is x feet from the pole? 


Figure 3.891 


Ans.: 74 feet per second. 


8 We may be short on information about formation of raindrops in clouds, 
but we can study the growth of a spherical drop during the part of its develop- 
ment when, for some constant k, the rate, in cubic centimeters per second, 
at which it collects water is the product of % and the area of its surface. At 
what rate does the radius increase? Ans.: k centimeters per second. 

4 It is observed that the radii of volatile mothballs decrease at the rate of 
0.5 centimeter per year. Find the rate at which mothballstuff is evaporating 
from a collection of 100 mothballs of radius 0.6 centimeter. Ans.: About 226 
cubic centimeters per year or about 0.62 cubic centimeter per day. 

5 Sand is falling at the rate of 2 cubic feet per minute upon the tip of a 
conical sandpile which maintains the form of a right circular cone the height of 
which is always equal to the radius of the base. Sketch a figure and calculate the 
rate at which the height is increasing when the height is 6 feet. Ans.: 1/18m feet 
per minute. 

ó Thread is being unwound at the rate of 4 centimeters per second from 
an ordinary circular cylindrical spool of radius R centimeters. The unwound 
part of the thread has length s and is stretched into a line segment TE tangent 
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to the spool at the point 7. Find the rate of increase of the distance from the 
axis of the spool to the end £ of the thread. Hint: It is not necessary to make an 
extensive study of the path traversed by the end Æ; it is sufficient to construct 
and use an appropriate right triangle Ans.: As/~/R2+ s? centimeters per 
second. 

7 A particle is moving with constant speed & along the graph of y = sin x 
in such a way that its x coordinate is always increasing. Derive the formulas 


BY = cos x Z dx o È dy __kcosx _ 
di di dt VIF costx d y1 F cosx 


involving the scalar components of the velocity. Show also that 


d?x k? sin xcosx 


di? T (1 + cos? x)? 


8 A particle of mass m starts from rest (that is, starts with speed 0) at the 
point (xo,yo) of Figure 3.892 and, with the earth’s gravitational field pulling it 
downward, slides without friction on the graph of the equation y = x% We 
grasp an opportunity to see how basic scientific concepts can be employed to 
obtain information about the motion of the particle. 
When 0 S y < yo and the particle is at the point (x,y), 
the loss in potential energy is mg(yo — y) and the gain 
in kinetic energy is $ml|v|2, so 


|v]? = 2g(yo — y). 


With the aid of this information, derive the formulas 


dx\? _ 4, 20T (2) =8 g Ze — y) — y) , 


which determine (except for algebraic sign) the horizontal and vertical scalar 
components of the velocity of the particle when it is at the point (x,y). 
Remark: There is a reason why the formulas refuse to tell the signs of dx/dt and 
dy/dt. As time passes, the particle oscillates to and fro over an arc of the 
parabola in such a way that the scalar components of the velocity are sometimes 
positive and sometimes negative. 

9 Figure 3.893 shows a connecting rod of length b which earns its name by 
connecting a piston (which is free to 
move to and fro in a cylinder) to a 


Nils 7 V 
point P on a crankshaft which is free Piston 
to rotate in a circle of radius a having T 
its center at O0. We should not be too 7 
busy to observe that b exceeds 2a in Figure 3.893 


ordinary engines and pumps. Obtain 
a formula relating dx/dt, the scalar velocity of the piston, to d0/dt, the angular 
speed of the crankshaft. Hint: Use the law of cosines in the form 


62 = a? + x? — 2ax cos 6. 
dx ax sin @ dô 
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10 Let ¢ be the angle between the lines of Figure 3.893 that have lengths b 
and x. Show that 
dp _acos0d@ 
di bcos¢ dt 


Hint: At each time the numbers a sin 0 and b sin @ are equal to each other 
because they are both equal to the distance (or the negative of the distance) 
from P to the line having length x. Thus we use a slight extension of the trigo- 
nometric law of sines. 

11 A circle of radius R has its center at the point (0,R) of an xy plane. A 
motorcycle is racing at night along the circle in the first quadrant toward the 
origin. When the motorcycle is at distance s (measured along the circle) from 
the origin, its headlight illuminates a spot at the point (x,0) on the x axis. Show 
how the rate at which the spot is moving depends upon R, s, and the rate at 
which the motorcycle is moving. Outline of solution: Study of an appropriate 
figure can lead us to the first and then the second of the formulas 


(1) 3 = tan @, x = Rtan P 


where @ is the angle between the vectors running from the center of the circle 
to the origin and the point (x,0), and 


3 _ length ofarc _ s/2 __ s 
(2) ~ radis R 2R 
Since 
d _ dsing _ cos? ġ + sin? ddd _ » dO 
(3) ay r $ dt cos $ cos? @ di °° $ dt’ 


(4) Zaj s ds 


Remark: In the context of the motorcycle problem, ds/dt and dx/dt are both 
negative. The answer will give very interesting information to those who study 
it. Those who do not use the metric system for measuring distances and speeds 
may observe that if s and R are numbers of feet and ds/dt is a number of miles 
per hour (or furlongs per fortnight), then dx/dt will be a number of miles per hour 
(or furlongs per fortnight). 

12 A man ora boy or a particle is, for reasons that are sometimes explained, 
at the point Pi(x1,Y171) and is moving with speed q; in the direction of the unit 
vector ai + bj + cık. A second animate or inanimate object is at the point 
P2(x2,72,%2) and is moving or being moved with speed ge in the direction of the 
unit vector aå + baj + cok. We are required to find the rate at which the dis- 
tance between the two objects is changing. Do it. Solution: Let the bodies 


be at the points P(x,y,z) and Q(u,o,w) at timet. The distance between the bodies 
at time z is then the positive number s for which 


(1) s? = (u — x)? + (9 — y)? + (w — 2). 
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Even though we have written (1) in such a way that ż does not appear, we rise 
to the occasion and differentiate with respect to ¢ to obtain 


ds dx dv dy dw dz 
a) $= u- 0 (G-F)+0-» (2-2) + w-9(# -%) 


To obtain the answer, we determine s from (1) and then ds/dt from (2) when 


(3) u = X2 v= Yə, w = 22, x = Xi, y = yi, 2 = Zi 
du do dw dx dy 
(4) z” 242 ad gabe, di 4202s de T VIb d qıbı, 


dt T Deb 


Remark: The formula (2) is trying to tell us something. The vector PO in the 
formula 


(5) PO = (u — x)i + (w — y)j + (w — dk 


is the displacement of Q relative to P. The velocity v of Q relative to P is 
obtained by differentiating this with respect toz. Thus 


(6) v=($-G)i+ Z- ait a - F) E 


Hence (2) tells us that 


ds — 
(7) sJ” POV, 


that is, the left side is the scalar product of the displacement vector and the 
velocity of Q relative to P. Since IPO| = s, the vector POQ/s i is a unit vector in 
the direction of PO. The relations (2) and (7) therefore bear a simple message. 
They tell us that the rate of change of the distance between two bodies is the scalar 
component of the relative velocity of the bodies in the direction of the line joining the 
bodies. This agrees with and is perhaps even a consequence of the fact that if 
one body moves in a circle having its center at the other body, then the distance 
between the bodies is always the radius of the circle. 

13 Kitty was riding a horse on a merry-go-round of radius R. When she was 
south of the center pole and going east with speed s, she exuberantly threw a ball 
toward the pole. Kitty expected to hit the pole, but unfortunately Chester was 
riding gallantly ahead of her and the ball hit him on the chin when he was east of 
the center. Sketch a figure showing the east and north vector components of the 
velocity (relative to terra firma) of the ball and mark the place where Chester was 
sitting when Kitty threw the ball. 


3.9 Increments and differentials As we become educated, we pick 
up assorted ideas akin to the idea that we never try to find the weight 
(1 avoirdupois grain or 0.0648 gram) of a kernel of medieval wheat by 
finding the weight of a truckload of the stuff and subtracting the weight of 
the decreased load resulting from removal of the kernel. The difficulty 
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lies in the fact that small relative errors in the large weights can produce 
huge relative errors in the small difference. The obvious way to find 
the order of magnitude of the number Ay defined by 


(3.91) Ay = f(x + Ax) — f(x) 


is to calculate the terms on the right side and subtract. However, when 
the numbers Ax and Ay are very small in comparison to x and f(x) and 
f(x + Ax), this calculation can be thoroughly tedious and impractical. 
The following alternative way of estimating Ay is very often used. If 
f and x are such that f'(x) exists, we can define ¢(x,Ax) by the formula 


(3.92) a = Fle tAn) — 1) © (x) + (edz) 


and conclude that 


(3.921) lim $(x,Ax) = 0. 
Az—->0 


Multiplying (3.92) by Ax gives the formula 
(3.922) Ay = f'(x) Ax + $(x,Ax) Ax, 


which separates Ay into the sum of two “‘parts.” In case f'(x) = 0 and 
Ax is near 0, the ‘“‘part” ¢(x,Ax) Ax is small in comparison to the “part” 
f'(x) Ax and the number f'(x) Axis the “principal part” of Ay. ‘Therefore, 
when f'(x) = 0, we can write the formula 


(3.93) Ay ~ f'(x) Ax 


to mean that the numbers Ay and f'(x) Ax have the same order of mag- 
nitude when Ax is near Q, that is, 


lim Al = 

azo f (x) Ax 

In any case, it is a common practice to use the number f'(x) Ax as an 
approximation to Ay when f, x, and Ax are given and |Ax| is judged to 
be small enough to make the approximation useful. In some cases it is 
equally useful to use the number Ay/f’{x) as an approximation to Ax 
when f, x, and Ay are given. 

We have seen that, in appropriate circumstances, the numbers Ay 
(an increment of y) and Ax (an increment of x) are such that Ay and f'(x) Ax 
are nearly equal in the sense that their ratio is nearly 1. While it may be 
difficult to see why we should become excited about the matter, it is 
worthwhile to think about and even use pairs of numbers dy and dx for 
which dy is exactly (not merely approximately) equal to f'(x) dx so that 


(3.94) dy = f'(x) dx. 


3.9 Increments and differentials 195 


Such numbers dy and dx are called differentials, and some useful observa- 
tions can be made. When f and x are such that f'(x) exists, we can let 
dx be any number that pleases us and calculate dy, and, provided f'(x) = 
0, we can also let dy be any number we please and calculate dx. Our 
interest in differentials can start to develop when we see that, as Figure 
3.95 indicates, the point (x + dx, y + dy) must lie on the line tangent 
to the graph of f at the point (x,y). This is true because f'(x) is the 
slope of the tangent and (3.94) implies that dy/dx is this slope when 
dx #0. Since increments Ay and Ax are numbers such that the point 
(x + Ax, y + Ay) lies on the graph of f and differentials dx and dy are 
numbers such that the point (x + dx, y + dy) lies on the tangent to 
the graph at the point (x,y), it is clear that both of the two equalities 
dx = Ax and dy = Ay can be true only when the two points (x + Ax, 
y + Ay) and (x + dx, y + dy) coincide at a point of intersection of the 
graph and the tangent. 


(x+ Ax, y+ AY) O (x+ Ax, yt Ay) 


(x+ dx, y+ dy) 


Figure 3.95 Figure 3.951 


It is particularly easy to produce the differential formula (3.94) when 
we use the Leibniz notation for derivatives. The calculation in the left- 
hand column 


y = f(x) y = x? 
dy _ dy _ 
LT T Oe 


dy = f'(x) dx dy = 2x dx 


produces the formula whenever f and x are such that f'(x) exists, and the 
calculation in the second column shows how things go when f(x) = x’. 
This circumstance emphasizes the fact that, when f and x are such that 


f'(x) exists and a is as usual the derivative f'(x), the differentials dy 


d 
and dx are defined in such a way that the quotient 2 (dy divided by dx) 


is the same as the derivative a when dx * 0. To find the differential 


formula relating dy and dx when f and x are given, it is therefore sufficient 
to set y = f(x), differentiate to obtain the formula 


(3.952) a = f'(x), 
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and then multiply by dx. A little experience with these things makes us 
realize that if y = sin x, we can write the formula dy = cos x dx without 
bothering to write the intermediate step dy/dx = cos x. 

In most situations where increments Ay, Ax and differentials dy, dx 
simultaneously appear, it is convenient to suppose that dx = Ax. In 
such cases, glances at figures more or less like Figure 3.951 can fortify 
the idea that Ay can easily be twice dy when Ax and dx are equal but 
not small, but that Ay and dy must have the same order of magnitude 
when f'(x) = 0 and the equal numbers Ax and dx are near 0. Thus a 
useful cookbook modus operandi runs as follows. 

When f, x, and Ax are given such that f'(x) = 0 and we want an approxi- 
mation to the number Ay defined by 


(3.96) Ay = f(x + Ax) — f(x), 
we put Ax = dx, calculate the number (or differential) dy defined by 
(3.961) dy = f'(x) dx, 


and use dy as an approximation to Ay. 

It is instructive to consider a thoroughly simple example in which all 
of the details are easily understood. Letting x and Ax be numbers which 
could be 38.27 and 0.05, we can determine the increment Ay in the area 
of a square when the lengths of its sides are increased from x to x + Ax. 
Letting y = x? and y + Ay = (x + Ax)’, we find that 


(3.97) Ay = (x + Ax)? — x? = 2x Ax + Ax? 


When Ax > 0, the number 2x Ax is the sum of the areas of the two 
rectangles of Figure 3.971 which have dimensions x and Ax. The number 


Ax | o 


Ax 
Figure 3.971 


Ax? is the area of the smaller square in the upper right-hand corner of 
the figure. The differential dy is, when dx = Ax, 


(3.972) dy = 2x dx = 2x Ax, 


and it is easily seen that this is a good approximation to Ay when Ax is 
small in comparison to x. 

It is sometimes convenient to solve problems more or less like the fol- 
lowing one in order to determine the accuracy of measurements required 
to produce required accuracy of results computed from the measurements. 
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If we measure the edges of a cube and decide that, subject to errors in 
measurement, each side has length x, we conclude that, subject to con- 
sequences of errors in measurement, the volume V of the cube is x3. 
If the edges have exact lengths x + Ax, then the exact volume is (x + Ax)%, 
or Y + AV, and the number AVP is the error in V produced by the error 
Ax in x. In quantitative treatments of this matter, we let Y = x? and 


use the differential dV defined by 
(3.98) dV = 3x? dx 


as an approximation to AX. In some practical situations, it is reasonable 
to assume that (for some positive constant p that might be 4 or 2 or 10), 
the error dx in x has a magnitude not exceeding per cent of x. This 
means that 


(3.981) Idx] < Ee |x]. 
With this assumption, we find from (3.981) that 


Ê o 3? 3 _ 3? 
(3.982) aP] < 3x? Few = at = TE, 


This leads us to the idea that if we measure the length of the edge of a 
cube with an error not exceeding p per cent, the resulting error in the 
computed volume will not exceed 3 per cent. 


Problems 3.99 


I Find the increment A< and the differential d4 of area produced when a 
circular disk of radius 7 is expanded or contracted to a circular disk of radius 
r+ h. Ans.: AA = w(2hr + h?) and dd = 2rrh. Remark: We have another 
opportunity to try to understand a formula. Sketch a figure in which |A| is 
small in comparison to r and observe that the difference of the two disks is a 
circular ring of thickness |4|. Since the inner (or outer) boundary of this ring 
has length 277, it is not surprising that the area of the ring is approximately 
2ar\ hI. 

2 The area 4 of a sphere of radius 7 in £3 is 4rr?; this should seem to be 
reasonable because the area of a hemisphere should be about twice the area of an 
equatorial disk. The volume V of the spherical ball bounded by this sphere is 
g7r8, Find the increment AV and the differential dV of volume produced when 
the radius changes from r tor + h. Show that the formula for dV can be put 
in the form dV = Ah and try to see a geometrical reason why AA should be a 
good approximation to AV when |A| is small. 

3 Use differentials to obtain an approximation to the number of cubic 
centimeters of chromium plate that must be applied to the lateral surface of a 
circular cylindrical rod 30 centimeters long to increase its radius from 2.34 
centimeters to 2.35 centimeters. Ans.: About 4.4 cubic centimeters. 
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4 Suppose that x and y are differentiable functions of ¢ such that 


(1) x+y = 1, 

Show that differentiating with respect to and multiplying by dt gives the formula 

(2) x dx + ydy = 0. 

Remark: In case t = x, we can divide (2) by dx and recover the first of the formulas 
dy dx _ 

(3) x+y a = 0, sh ty =O 


which is valid when y is a differentiable function of x for which (1) holds. In 
case? = y, we can divide (2) by dy and recover the second formula in (3), which 
is valid when x is a differentiable function of y for which (1) holds. 

5 Supposing that n is a constant and x is positive, observe that the first 
of the formulas 


(1) y = x", log y = n log x 
is equivalent to the second. Use these formulas to obtain 
(2) dy = nx! dx 
and 
1 1 
(3) 7 dy =n x ax. 


Use (2) to show that if |dx| S$ (p/100)x, then |dy| S (|n|p/100)y. Then use (3) 

to obtain the same result. 
6 Gradual changes in tensions or compressions or temperatures can produce 
gradual changes in the lengths of iron rods that form a triangle such as that 
shown in Figure 3.991. Therefore, because the law of cosines 


v Á „ must hold at each time #, it makes sense to suppose that we 
| 


have four positive differentiable functions of ¢ such that 
u 


, (1) w? = u? + 2? — 2wv cos 6. 
Figure 3.991 
Equate the derivatives with respect to ¢ of the members of (1) 


and multiply by dz to obtain the differential formula 
(2) w dw = (u — v cos 8) du + (v — u cos 6) dv + uv sin 6 dé. 


Remark: It is sometimes both possible and unwise to underestimate potentialities 
of formulas. The formulas (1) and (2) contain eight numbers u, v, w, 6, du, do, 
dw, and d@. There are many situations in which some information about some 
of these numbers is known and the two formulas can be used to eke out more 
information. Some problems are much more recondite than the one solved by 
finding w from (1) and then finding dw from (2) when the values of the other six 
numbers are known. 


7 Supposing that T, L, and g are positive, observe that the first of the 
formulas 


L 
(1) T= m fE T = Ant log T = log 2r +5 log L — 5 log g 
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is equivalent to the other two. Use these formulas to obtain 


—}4 — 
4 g 
(3) ITdT = grr- L dg 
g 
1 11 11 
(4) pat = 77 dl — 55 de. 


Making a suitable application of the fact that |4| < |B] + |C| whenever 4, B, 
and C are numbers (not necessarily positive) for which 4 = B+ Cor4 = B—C, 
use (2) to show that if |dZ| < (p/100)L and |dg| S (¢/100)g, then |d7| S 
[$(p + g)/100]T. Repeat the process by use of (3) and (4). Remark: The 
first of the formulas (1) is a standard formula for the period T (a number of 
seconds) of a pendulum of length L which oscillates in a world where the scalar 
acceleration of gravity is g. Our result shows that if errors in measurement of L 
and g do not exceed p and g per cent, respectively, then the error in T will not 
exceed $(p + g) per cent. 

8 A pendulum clock gains 3 minutes in 24 hours. By what per cent should 
the pendulum be lengthened? Ans.: 0.42 per cent. 

9 Under appropriate conditions the pressure p and the volume V of confined 
gas satisfy the relation 


(*) pV =C, 


where y (gamma) and C are constants that depend upon the gas and the condi- 
tions. Obtain the formula 


dp ydy 
** tE p te _ 
(**) F + 77 0 


in two different ways. First, differentiate the members of (*) as they stand. 
Then operate upon the equation obtained by taking logarithms of the members 
of (*). Remark: It is so often desirable to take logarithms before differentiating 
that the process is named logarithmic differentiation. The derivative of the 
logarithm of a function is called the logarithmic derivative of the function. 

10 Apply the procedure of the preceding problem to the relation 


pF = nRT, 


in which v is the number of gram-moles of a gas, R is a universal proportionality 
constant known as “the gas constant,” and T is the absolute, or Kelvin, tempera- 
ture. It is now supposed that p, V, and T are all functions of ¢ and the relation 


is to be obtained. 
11 For dense projectiles fired short distances over a horizontal plane, the 
range R is calculated from the formula 


2 
To . 
R = —sin 2a, 


200 Functions, limits, derivatives 


where g is the (scalar) acceleration of gravity, vo is the initial speed, and æa is the 
angle of elevation of the gunso0 < œ < r/2. Find a formula in which we can 
put estimates of errors in g, %, and @ to obtain an estimate of the resulting error 
in R. Hint: Use logarithms. Ans.: 

dR 
R 


d Do 
Vo 


<2 


— 


dg 2a cos 2a | | da 
+ |S [+ |e |S) 
Remark: When a@ is not too close to 7/2, the factor multiplying |da/a| has the 
order of magnitude of [cos 2a]. When @ is near r/2, the factor is very large. 
We are now able to enlighten rabbit hunters when they ask us why they are 
unlikely to hit their targets when they shoot almost straight up. 

12 Find the maximum possible percentage of error in the computed estimate 
of the volume of a cone that can be caused by errors not exceeding p per cent and 
q per cent in measurements of the height and the base radius of the cone. Ans.: 
p + 2q. 

13 When three resistors having resistances 71, re, 73 are connected in parallel, 
the resulting resistance R is determined by the formula 


With the aid of the fact that resistances are always positive, prove that if no 
error in a resistor exceeds p per cent, then the error in R produced by these errors 
cannot exceed p per cent. Remark: This conclusion really means something to 
those who design the mazes hidden in our television sets. Problems involving 
silver bands and gold bands and tolerances (percentages of error) cannot be 
ignored. Engineers do not like to behave like rabbit hunters who shoot almost 
straight up. 

14 Show that the conclusion of the preceding problem is violently false if the 
numbers 7), 72, 73 are not resistances of resistors but are numbers of which some 
can be positive and some can be negative. 

15 Let r be a differentiable function of ¢. For example, we may have 


(1) r(t) = x(@¢)i + y@)j + 2@)k, 

where all of the functions are differentiable and the vectors i, j, k are the unit 
orthonormal vectors of Section 2.2. By a definition analogous to the one involv- 
ing scalar differentials, a vector dr and a scalar di constitute a pair of differentials 
if 

(2) . dr = r’(2) dt. 

With or without using the rectangular representation (1) and the fact that 


r@©? = e) + b@OrP+ l), 
prove that 


d|r(z)|? = 2r(z)-dr(z). 


16 The specific heat of a substance is sometimes said to be the number of 
calories of heat required to raise the temperature of 1 gram of the stuff 1 degree 
centigrade. This definition is sometimes useful but, because substances have 
different specific heats at different temperatures, the following definition is much 
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better. The specific heat o (sigma) of a substance at temperature x is Q’(x), 
where Q(x) is the number of calories of heat required to raise (or lower) the temperature 
of 1 gram of the stuf from O°C to x°C. For study of this matter, let ¢* be the 
specific heat calculated from the first definition so that 


ay) ot = PEF =O) gL Cy) = lim LE tH = OW). 
l h-0 h 


We can see one of the reasons why knowledge of calculus is needed for study of 
physical chemistry when we see that o* is a difference quotient and ø is a deriva- 
tive. Since the graph of Q is never (or not ordinarily) a line, ¢* and are usually 
different. The schematic Figure 3.992 illustrates one situation. As is easily 


> Q(x+1) 
Q(x) 


C g 


x x+1 


Figure 3.992 


imagined, there are situations in which the graph of Q is “almost straight” over 
the interval from x to x + 1 and o* is a “good approximation” to ø. On the 
other hand, o* can be dust and ashes when the interval from x tox + 1 straddles 
a temperature at which a substance changes from a solid state to a liquid or from 
a liquid to a gas. 

17 There are reasons why we should conclude with a historical remark. In 
the good old days when the ‘“‘doctrine of limits” was based upon visions of gal- 
loping numbers and the “‘infinitely small infinitesimals” were considered to be 
almost the most wonderful products of human thought, differentials were con- 
sidered to be the most wonderful. Differentials were the important things, and 
the things that we now call derivatives were merely the “differential coefficients” 
appearing in formulas like dy = f'(x) dx or dy = 2x dx. Thus differentials have 
their origin in old mathematics; it was the fashion to consider them to be “infi- 
nitely small’? but not quite zero. When at long last the concept of the ‘‘infi- 
nitely small”? was becoming obsolete, attempts were made to salvage differentials 
by promoting the idea that they really are not ordinary numbers at all but are 
numbers that are in the process of approaching zero.f So far as this course is 
concerned, the details of this remark are unimportant. We should, however, 
know that differentials have a long and checkered history and that we may 
expect to encounter some quite strange concepts as we get around in the world. 


t For those who have not peered into old books and consider this to be too incredible to be 
true, we quote three passages from W. E. Byerly, “Elements of the Differential Calculus,” 
Ginn and Heath, Boston, 1879. Page 149 tells us that ‘“4n infinitesimal or infinitely small 
quantity is a variable which is supposed to decrease indefinitely; in other words, it is a variable 
which approaches the limit zero.” Page 185 tells us that “‘It is to be noted that a differential 
is an infinitesimal, and that it differs from an infinitesimal increment by an infinitesimal of a 
higher order.” Page 186 tells us that “there is a practical advantage... in regarding the 
differential as the main thing, and looking at the derivative as the quotient of two differentials.” 
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4.1 Indefinite integrals There are about as many different types of 
integrals in mathematics as there are elements in chemistry, but only a 
few of them occur in first courses in calculus. This chapter introduces 
basic ideas about two kinds of integrals. These ideas may not be coming 
too soon to meet the needs of students taking other courses in which 
mathematics appears. In this section, and in some other places where 
the deviation from complete linguistic rectitude does not create deceptive 
statements, we sometimes refer to “the function f(x)” or to “the function 
having values f(x)” instead of to the function f which, for each x in some 
interval, has the value f(x). 

It is very often true that we have a given function f(x) and we are 
interested in those functions F(x) or y or y(x) (if any) for which 


(4.11) P(e) =f@) or B= fe) 


when x lies in some interval. Before discussing this situation, we intro- 


duce notation that is universally used. In case F(x) or y is a function for 
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which (4.11) holds, we represent it by the ingenious symbol in the formula 
(4.111) F(x) = [f(x)dx or y= Sf(x) dx. 


The second equation is read “y equals an integral of eff of x dee x.” 
We should all know that it can be read “y equals an indefinite integral 
of eff of x dee x,” or “y equals a function whose derivative with respect 
to x is f? or “y equals an antiderivative with respect to x of f,” but 
simplicity always prevails and we read what we see and say what is to 
be written. The integral sign f is an elongated S, the f(x) is called the 
integrand, and the dx tells us that derivatives with respect to x are 
involved.f ‘This matter turns out to be so important that we must con- 
tinually remember the following definition. 
Definition 4.12 The indefinite integral in the formula 


(4.121) y = Sf(x) dx = $(x) 


is (if it exists) a function of x whose derivative is the integrand f(x); in other 
words, the formula 


(4.122) 2 = f(x) = (x) 


and the formula (4.121) are both true or both false. 
For an example, let us see what we know and can learn about the 
functions y for which the equivalent formulas 


(4.123) = = 2x, y = f 2x dx 


are valid. We may remember that we differentiated x? and got 2x. 
Hence a function y for which the formulas are valid might be x? but it 
does not have to be because y might be x2 + 1 or x? — 5 or x? + 416. 
It can be proved that a given function y will satisfy the equivalent 
formulas (4.123) if and only if there is a constant c such that y = x? + c. 
Thus 


(4.124) fx dx = x? +e. 


To be precise about this matter, we state the following theorem which 
will be proved later in a remark following the proof of Theorem 5.57. 


tł Perhaps ıt should be emphasized at once that the dx in the symbol is not a number. If 
we resist temptations to jump to the conclusion that the dx and the crossbar on the f and 
the integral sign are numbers, we overcome a difficulty that makes some people feel that the 
good old symbol should be abandoned in favor of another which provides fewer temptations. 
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Theorem 4.13 If two functions y and F have the same derivative over an 
interval, then there is a constant c such that 


y(x) = F(x) +c 


for each x 1n the interval. 
Considerable information is packed into the little formula 
; 


(4.14) y = Jf(x) dx = F(x) + c 


in which F(x) is any one particular function whose derivative with 
respect to x is, over some interval, the integrand f(x) of the integral 
It tells us that, whatever the value of the constant c may be, F(x) +, 
is a function y whose derivative with respect to x is the integrand f(x), 
Moreover, it tells us that if y is a function whose derivative is the inte- 
grand, then there must be a constant c for which y = F(x) + c. The 
full meaning of the assertion (4.14) has been stated, and this is what js 
important. Simply because we must converse with our fellow men and 
must read scientific writings, we must join our fellow men in learning 
some terminology. The constant cin (4.14) is a “constant of integration” 
and the poor fellow is sometimes said to be “arbitrary.’? The integral 
is called an “indefinite integral” to distinguish it from other types of 
integrals that are sometimes called ‘‘definite integrals.” This rather 
strange terminology will not injure us if we do not allow it to interfere 
with our understanding of the meaning of (4.14). The assertion “each 
indefinite integral of f is the sum of a particular indefinite integral and 
a constant of integration” sounds weird but is true. The “meaning” 
of the word “indefinite” can be understood if we realize that when c is a 
constant, say 416, F(x) + c is an “indefinite integral” of f(x) just as 
the mayor of Chicago is an “indefinite citizen” of Chicago. 


In case F’(x) = f(x), G’(x) = g(x), and a, b are constants, differentia- 
tion formulas show that 


(4.15) Slaf~) + bg(x)] dx = aF(x) + bG(x) + c 
and 
(4.16) Slaf(x) + bg(x)] dx = aff(x) dx + bfg(x) dx. 


These formulas tell us that “integrals of sums are sums of integrals” and 
that “constants can be moved across integral signs.” The formulas do 
not provide justification for moving functions across integral signs; other- 
wise, we could replace = by = in the formula 


(4.161) JF) dx A fla)Sdx = f(x)(~ + c) 


and eliminate all of our troubles. 
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The following little table gives two versions of each of the five simplest 
and most useful integration formulas. 


a _ ntl „du B ynth 
(4171) fa ax te Jefi +e 
(4.172) | sin eax = —cosx +e [ sinu $e = — cosu +c 
(4.173) | cos # dx = sin +c [ osude = sinue 


(4.174) [earnete T de= ete 


1 
(4.175) J; dx = log |x| + ¢ ar dx = log |u| + c 


In the formulas of the second column, u is supposed to be a differentiable 
function of x. Subject to the requirement that n = —1 in (4.171), and 
that x and u are confined to intervals over which the integrands in (4.171) 
and (4.175) exist, these formulas are proved by observing that they have 
the form (4.14) where F’(x) = f(x). We need not learn all of the formulas 
we see, but the formulas in the above table are used so often that they 
must be learned. 

When the formulas in the column on the right are being used, presence 
of the factor du/dx must be carefully observed. It is not correct to 
think of u as being sin x and to claim that use of (4.171) shows that the 
members of the formula 


(4.181) J sin? x dx < S2 on vtec 


are equal. We can, however, think of u as being sin x and read the left 
member of the formula 
| 


8 
(4.182) J sin? x cos x dx = = f +e 


in the form “‘integral of u to the nth power dee u dee x dee x”? and then 
apply (4.171) to obtain the right member. 
It is not correct to claim that the members of the formula 


(4.183) J (Sx + 7)? dx s CEE O 4 


are equal. We can, however, let J denote the left member, observe that 
the integrand has the form u”, where du/dx = 5, and write 


(4.184) 1-3 | Ge+7 (5) dx = = EM be, 
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Thorough understanding of this particular example is of utmost impor- 
tance because it involves an idea that is very often used to overcome a 
difficulty. In (4.183) we have an integral of the form fu” dx which does 
not have the form fu” (du/dx) dx. However, du/dx is 5, a constant, 
sO we can insert the factor 5 in the integrand and compensate for the deed 
by inserting the factor 4 before the integral. 

To obtain the formula 


(4.185) | i = 2x dx = log (1 + x?) +c, 


J 


we read the left side “integral of one over u dee u dee x dee x” and apply 
(4 175). If the factor 2 had been missing from the integrand in (4.185), 
it would have been necessary to insert the factor and compensate for the 


deed. Thus 


ase) |pžai -= TES „2e de = L log (1 + æ?) + c. 


Our very modest table of integrals beginning with (4.171) does not 
reveal the answer to the question whether there are any functions F(x) 
for which the formulas 


1 1 
(4.187) F'(x) = are peste =F(x) +c 


are valid. Many useful purposes are served by this table and the more 
extensive one appearing opposite the back cover of this book, but one 
who has solved several of the problems at the end of this section is ready 
to recognize the fact that there exist much more elaborate tables of 
integrals. The books of Buringtont and Dwightf are exceptionally use- 
ful examples of books that give hundreds of integration formulas, tables 
of values‘of functions, and other mathematical information. It is possi- 
ble to proceed through our course without using tables other than those 
on the back cover and facing page of this textbook. However, students 
who contemplate following educational programs in which mathematics 
appears are well advised to purchase one of these books (or perhaps 
another more or less similar one recommended by teachers) and to spend 
occasional moments (and sometimes hours) inspecting its organization 
and studying its contents. Ability to understand and use the tables is 
not inherited but can develop rapidly as more calculus is learned. 
Experience shows that persons who have completed courses in calculus 


t R. S. Burington, “Handbook of Mathematical Tables and Formulas,” 3d ed., McGraw- 
Hill Book Company, Inc., New York, 1948, 296 pages. 

ł Herbert Bristol Dwight, “Tables of Integrals and Other Mathematical Data,” 4th ed., 
The Macmillan Company, New York, 1961, 336 pages. 
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refer to tables in books of tables in preference to tables in calculus text- 
books. Teachers can be particularly helpful when they require that 
their students purchase identical books of tables and make frequent 
comments about use of the tables. Sometimes use of a book of tables is 
permitted in tests and examinations where use of a calculus textbook is 


forbidden. 


Problems 4.19 


1 Tell the meaning of ff(x) dx. Be prepared to give full information at 
any time. 
2 Show that, when x is properly restricted, 


453 


(a) [atott demat y ye te 


o {= deat ete 9 
TOPTEN 
() [ x —2) dx = 5 - Fhe 


3 Is the formula 
fx? dx = xfx? dx (?) 
true or false? 
4 Brevity is sometimes but not always a virtue. It can be claimed that the 
second formula in (4.171) would be much more easily understood and used if it 
were written in the form 


f [u (x)| u (x) dx = KO 


Think about this, and then write the other four formulas in terms of the Newton 
notation for derivatives. Note that the last formula takes the form 


J oe dx = log |u(x)| +c. 


Remark: We can abbreviate (4.171) to the form fu” du = u”t!/(n + 1) + c, but 
for present purposes we can hold the view that further abbreviation of (4.171) is 
a step in the wrong direction. We need not be in a hurry to join the ranks of 
gullible people who think that the du appearing in the symbol fu du is a number 
because a correct result is obtained by pretending that du is “the” differential 
u' (x) dx and writing 


fu du = fu(x)u'(x) dx = s[u(x)]? + c. 


When we do not use the abbreviation, we do not need to worry about it. 
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5 Look at the integral 
SQ. + 5x)% dx 


and tell what must be done to enable us to evaluate the integral by use of the 
formula involving u”. 
6 Evaluate the integrals 


(a) if (1 — x)? dx (b) | sin 2x dx (c) f cos 3x dx 
(@) | a- 2de (e) feds a f zr dx 
Ans.: 

(a) —(1 — x)t/4 +e (b) —ż cos 2x + c 

(c) 4 sin 3x +e (d (1 — 2x)? +c 

(e) ġe +c (f) zlog [2% + 3| +e 


7 Sometimes we can make small alterations in the way integrands are written 
to put the integrands into forms where basic formulas are easily applied. Pay 
careful attention to the examples 


[tanade= [ ax = -f l (—sin x) dx = — log |cos x| + ¢ 


cos x% COS x 
| azè = | gi d = boo 
x log x” ~ J logan ~ °°8 og x| + c 
frer dx = +/ e?(2x) dx = ġe? + c. 
Then evaluate 


la) Javi F eax (b) [PE ae. 


8 While the terminology plays a minor role in elementary calculus, we can 
start learning that the equation 


dy _ 
(1) I 2x 


is an example of an equation that should be called a derivative equation but is 
called a diferential equation. Functions y for which (1) holds are called solutions 
of (1), and we know that (1) has many solutions. The particular solution of 
(1) satisfying the boundary condition 


(2) y = 16 when x = 3 
is found in a straightforward way. If (1) holds, then 
(3) y = f2x dx =x? +c, 


where ¢ is a constant that can be 5 or —3 or 416 but cannot be all of these things 
at once. The function in (3) will satisfy (2) if 16 = 9 + c and hence if c = 7. 
Thus the answer is y = x2 + 7. With clues to methods being provided by this 
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example, find the solutions of the following differential equations satisfying the 
given boundary conditions. 


(a) = =0,y = 1 when x = 0 Ans.iy =1 


\ 


(6) Tx 1,y = 2 whens = 3 Ans.iy=x—l1 


(c) — = cos 2x, y = 0 when x =0 Ans.: y = sin 2x 
(d) z = 5y = l whenx = 0 Ans.: y = ġe + 3 


9 A body moves to and fro on a line in such a way that its scalar velocity 
gat time ż is given by the formula 


o = 22? — 8¢ + 15. 


During what interval of time is the scalar velocity negative, and how far does the 
body move during that time? Hint: If s is the coordinate of the body at time ż, 
then ds/dt = v and hence 


3 
saz -+l 


where ¢ is a constant that is 0 if we choose the origin such that s = 0 when? = 0. 
Ans.: $ units. 
10 Ify is a function of x satisfying the differential equation 


(1) T = PY ~ 


and if we know that y > 0, then we can divide by y to obtain the first and then 
the rest of the formulas 


(2) -— = k, log y = kx + c, = gte y = etec, 


where ¢ is a constant that depends upon the particular function y with which 
we started. But ¢¢ is a constant that we can call 4, so 


(3) y = Ae. 


If we know that y satisfies the boundary condition y = yo when x = 0, then we 
can put x = 0 in (3) to find that 4 = yo and hence 


(4) y = yo. 


Remark: Without assuming that y = 0, we can solve (1) with the aid of a trick. 
Transposing a term in (1) and multiplying by ¢~** give the first and hence the 
second of the formulas 


d 
e— ke (2 — ky) = Q, Tn ety = Q. 
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This gives the first and hence the second of the formulas 
ety = A, y = Ae. 


More complete treatments of these matters are given in textbooks on differential 
equations. 

11 After having digested the preceding problem (which required that some 
ideas and methods be absorbed), find the solutions of the following differential 
equations that satisfy the given boundary conditions 


(2) 2 =y,y=1whens=0 Ans.: y = ¢ 

(b) a = 2y, y = 3 when x = 4 Ans.: y = 3ere-* 
dy 

(c) Ja = 9 Y = 0 when x = 0 Ans.: y =0 


12 Fill in the right members of the three formulas 


ax, dx, x=? 


d dt 


when 
dx 1 42 

(a) T =g Ans.: g, gt + ci zg + cettc 

(b) x = bg? Het + ce Ans.: Same as (a) 

(c) 5 = sin? Ans.: cos t, sin t, —cost+¢ 
dx ° l1 >» 

(d) FA = cos 2ż Ans.: —2 sin 22, cos 22, ș sin 2t +c 
dx ə% 1 

(e) 7; =e Ans.: 2e%, et, ge” +c 


13 A particle P moves in the xy plane in such a way that its acceleration a 
(a vector) is always —gj. Thus 
a = 0i — gj. 


Show that its velocity and displacement vectors must be 


v= ci + (—gt + k)j 
r = (ct + c)i + (—Set? + kit + bo)j, 


where the c’s and #’s are constant. Find the equation of the path in rectangular 
coordinates when cı 0 and again when cı = 0. Hint: To solve the last part, 
put r = xi + yj so that 


= cıt + Co, y = — set? + hit + ko 
and eliminate 1. 


14 This problem requires us to think about indefinite integrals and gives our 
first glimpse of the famous and important formula for integration by parts. 
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Let x be confined to an interval J over which two given functions u and v are 
differentiable. The standard formula 


(1) A u(x)o(x) = u(x)o' (x) + o(x)u' (x) 
then (why?) gives the formula 
(2) Slulæ)o (x) + olx)u’ (x)] dx = u(x)o(x) + c. 


In case the separate integrals are cooperative enough to exist, we can (why?) 
put (2) in the form 


(3) Su(x)o’(%) dx + fo(x)u'(x) dx = u(x)o(x) + c 
and transpose to obtain the formula 
(4) Julx)o' (x) dx = u(x)o(x) — fo(x)u’(x) dx, 


which is known as the formula for integration by parts. For the particular case 
in which u(x) = x and o(x) = e*, show that (4) reduces to 


(5) fxe? dx = xe? — e + c. 


Finally, check (5) by showing that the derivative of the right side actually is the 
integrand. 

15 Read and work the preceding problem again. 

16 With Problem 14 out of sight, start with the formula for the derivative 
of a product and construct the formula for integration by parts and give an 
application of it. 

17 Start with the function fo for which fo(x) = 1 when —1 <x < l and 
determine the natures of the functions fi, fe, Ja © © + for which the formulas 


fala) = faal), fala) = Sfax) dx, (n = 1,2,3, + > >) 
are valid. Ans.: There exist constants ¢1, C2, ¢3, * * * such that 


filx) = x + C1 

Jx) = 5x? + 61x + Ce 

fala) = ex? + Hex? + cox + cs 

falx) = dw! + Few? + Geox? + ce + ca 


etcetera. Remark: These things will appear later. 
18 Prove that the first of the formulas 


f sgn x dx = |x| + ci, f sgn x dx = |x| + c2 


is correct when x > O and the second is correct when x < 0. Prove that there 
is no constant ¢ such that the formula 


f sgn x dx = |x| + c 
is correct when x = Q. 
19 This problem contains hundreds of parts, and there is much to be said for 
spending several hours or days solving a considerable number of them. To 
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solve one part, pick an integral formula from a (preferably your) book of tables 
that has the form 


Sf(*) dx = F(x) +e 


(except that most tables omit the constants) and show that F’(x) = f(x). This 
promotes an understanding of integral formulas and provides practice in differ. 
entiation. It is never too early to start acquaintance with formulas involving 
X where X is one or another of a + bx, a? + x?, a? — x7, ax? + bx + c, etcetera. 
In these situations, modesty and timidity are not virtues. We profit most when 
we attack the problems that seem most impenetrable and discover that they 
really are very simple. 


4.2 Riemann sums and integrals This section introduces the sums 
and integrals that are named after Riemann (1826-1866) in spite of the 
fact that Archimedes (287-212 B.c.) knew how special ones could be 
used in a few special cases. Let f be a function which is defined over an 
interval a S x < b and has values f(x) such that 


(4.21) m < f(x) S M TERET 


where m and M are constants. This amounts to saying that f is bounded 
over the interval; M is an upper bound and m is a lower bound. Our next 
few steps are so simple that it may be difficult to see why they are impor- 
tant. As in Figure 4.212, let x be a fixed (or selected) number for which 
a<<«*s5b. Thus x can be b, but it is not necessarily so. Let n bea 
positive integer. We make a partition P of the interval from a to x 
into n subintervals by inserting points to, ti, t2, © © * , tai, tn, where 


(4.211) a =t Lt Kt L +: Ltn <ir =x. 


These points are the circled points of the figure and are the end points 
of the subintervals. 


a it to t? t*a t* x b 
to tı to t3 tn-2 tn- tn t 


Figure 4.212 


Let Aż, denote the length of the first subinterval so that At; 


let Aż: denote the length of the second subinterval so that At. = 
and so on so that 


ti — to, 
to — hi, 


(4.213) At, = th — thy Iskan). 


It is not required that the points to, ti, - © - , tn be equally spaced. The 
greatest of the numbers At), Ato, © + + , At, is called the norm of the parti- 
tion P and is denoted by the symbol |P|. Thus |P| is the length of the 
longest of the subintervals in P. Our next act introduces the star char- 
acters. Let tř (read tee one star) be a number (or point) in the first 


4.2 Riemann sums and integrals 213 


subinterval so that to S tř < 4, let t* be in the second subinterval so 
that t; S t* S te, and so on so that 


(4.214) tS te Sh Ilsken). 


Our machinery, which is still very much simpler than that in an elec- 
trically operated dishwasher, enables us to produce numbers that are 
called Riemann sums. We multiply f(*), the value of f at t¥, by At, 
the length of the interval containing tř, and add the results. Thus, denot- 
ing the Riemann sum by the symbol RS, we have 


(4.22) RS = f(ef) Ati + f(eF) Ate + fH) Ats + + + + + f(t) Atn. 


Because it takes too long to write this, we abbreviate it to the form 
(4.221) RS = Y (tt) At. 
k=l 


The right side is read “sigma & running from 1 to n eff of tee kay star 
delta tee kay” and it denotes the sum of the terms obtained by giving & 
the values 1, 2, 3, --+,mn. The 2 (sigma) is called the summation 
symbol, and it is very convenient. 

Everybody should see that, when the function f and the numbers a 
and x are given, it is easy to select the partition P in very many different 
ways and to select the points #7 in very many different ways. When an 
electronic computer is kind enough to do the arithmetical chores, it is 
even easy to produce very many Riemann sums. 

Experience shows that we should avoid future difficulties by allowing 
the partitions and Riemann sums to slumber peacefully while we invest 
a moment to think about the names which we have attached to the parti- 
tion points and the intermediate star points that determine them. The 
points in Figure 4.212 were called to, t1, - © © , ta and tf, tf, © °° t. 
We could, without changing the value of the Riemann sum, have called 
these same points No, Ai, © © * , An and AF, AY, - - + , AX. Thus there 
is a sense in which the names of these points are “dummy names”; we 
could have called the points #’s or w’s or vs or Xs or ws or ¢’s. When 
this matter is understood, we must ask and answer two questions. First, 
why did we avoid the “natural” names xo, xı, °° © , &nand xf, xš, ° °° , 
xx? The answer is that we already have the interval from a to x on an 
x axis appearing in our work, and we will have too many x’s around the 
house if we allow any more toenter. Secondly, why did we use the names 
to, 1, °° + , ty and t*, tž, - ++, ¢*? The only answer we can give is 
that they are as good as any and better than most alternatives. In 
situations where we can conveniently use the “natural” names xo, *1, 

° Xn and x*, xž, + + + ,x*, we usually do so. Finally, we do not use 
the letter ¢ to denote “dummy integers” in (4.214) because the habit 
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of using 7 leads to awkwardness when we finish study of calculus and enter 
realms where í is always the imaginary unit whose square is —1. We 
use & because it is as good as any and better than most. 

We now come to the most fundamental remark that appears in the 
theory of Riemann integration; analogous remarks appear in theories 
of other integrals. Depending upon the function f and the numbers 
a and x that have been selected, it may be true (or it may be false) that 
there is a number J such that to each positive number e there corresponds 
a positive number 6 such that 


(4.23) | 5 fi) An I| <e 
k=1 


whenever |P| < 6. This is, of course, just a precise way of saying that 
there may be a number J such that each Riemann sum with a small norm 
is near Z. If this J exists, then f is said to be Riemann integrable over the 
interval from a to x and J is said to be the Riemann integral of f over the 
interval. This integral is denoted by the symbol in the formula 


(4.24) I= IP fli) di 


and the symbol is read “‘the integral from a to x of eff of tee dee tee.” 
The numbers a and x are called the lower limit and the upper limit of 
integration, and we always read the lower one first. The symbol ¢ is 
called a dummy variable of integration, the derogatory terminology being 
applied because the value of the integral would be the same if ż were 
replaced by s or u or a or 6 or any other symbol that cannot be confused 
with a, x, f, and d. It is a convenience (and sometimes also a source of 


misunderstanding, confusion, and controversy) to drag in the notation 
of limits and write ' 


(4.25) lim È, flit) A = f: FA) dt. 


|P —0 k= 


A much more substantial convenience results from boiling this down to 


(4.251) lim ) f@) At = [* FO) di, 


the idea being that we can restore the omitted embellishments whenever 
there is a reason for doing so. 


In case no such number J exists, we say that f is not Riemann integrable 
over the interval from a to x and that I, f(t) dt does not exist (that 1s, 
does not exist as a Riemann integral). To emphasize the fact that a 
bounded function f and an interval a S$ x S b can be such that fa f(t) dt 
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does not exist, we look briefly at an example. | Let f be the dizzy dancer 
function D, defined over the interval 0 £ x < 1, for which 


D(x) = 0 (x irrational) 
(4.252) D(x) = 1 (æ rational). 


It is clear that, whatever the partition P of the interval 0 S x < 1 may 
be, the Riemann sum 


(4.253) 2 D(t#) At, 

=1 
has the value 0 if the numbers tř, #¥, - - + , ¢* are all irrational and has 
the value 1 if the numbers tř, tř, - - - , ¢* are all rational. It follows 


from this that there is no number J such that to each positive number e 
there corresponds a positive number 6 such that (4.23) holds whenever 


IP] < 6. This shows that the symbol h D(t) dt has no meaning or that 
D(t) dt does not exist. 


If we suppose, as above, that f is a function which is bounded over an 
interval, then the following theorem shows that the answer to the ques- 
tion whether f is integrable over the interval depends only upon the set 
of discontinuities of f. 

Theorem 4.26 A function f is Riemann integrable over an interval if 
and only if it is bounded and the set of discontinuities of f which lie in the 
interval has Lebesgue measure zero. 

This theorem is proved in modern textbooks that fully earn the right 
to be called textbooks on advanced calculus. The proof is, from our 
present point of view, both long and difficult, and we do not need to 
know anything about it. Moreover, we do not need to understand the 
theorem, but we should not be injured by taking a hasty look at Figure 
4.261 and making a modest attempt to understand one of the definitions 


a hood, Íz I; I, k b 
ma aaa aama aŘĖĖA 
Figure 4.261 


which has fundamental importance in more advanced mathematics. 
A set D of points on a line is said to have Lebesgue measure 0 if to each 
e > 0 there corresponds a collection J;, Ie, Is, © + + of intervals such that 
each point of D lies in at least one of these intervals and, for each n = 1, 
2,3, -- + , the sum of the lengths of the first n intervals is less than e. 
Sometimes it is very easy to show that a given set D has Lebesgue measure 
0 by showing that if e > 0, then there exist intervals J, I2, J3 - + > such 
that each point of D is in at least one of these intervals and, moreover, 
the length of J; is less than e/2, the length of J» is less than €/2?, the length 
of I; is less than ¢/2%, etcetera. The collection of intervals may be a 
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finite collection, that is, it may contain only 1 or 2 or 3 or 416 or 31,690 
or some other positive integer number of intervals. The collection of 
intervals may be a “countably infinite collection,” that is, it may contain 
a first, a second, a third, etcetera, so that to each positive integer 
there corresponds an interval Ją. In each of these two cases, the collec- 
tion of intervals is said to be a “‘countable collection.” Only a most 
rudimentary understanding of these matters enables us to reach the con- 
clusion that if D contains only 0 or 1 or 2 or 3 or 416 or any other finite 
number of points, then D must have Lebesgue measure 0. In any case, 
we should have at least a hazy understanding of the fact that Lebesgue 
(1875-1941) was a great French mathematician and that Theorem 4.26 
implies the much simpler following theorem which we are required to 
know in this course. 

Theorem 4.27 If f is bounded over the interval a S x S b and tf f ts 
continuous over the interval (or is discontinuous but has only a finite set of 
discontinuities in the interval), then the Riemann integral in 


(4.271) JEKA at = lim X SGE) Ata 


exists whena <x Sb. 
As we near the end of the text of our introductory section on Riemann 
sums and integrals, we pause to think about our present state and future 


development. We have a new symbol, namely, f: f(t) dt. If a<b, 


ifa < x S b, and if fis defined over the intervala S x < b, then (depend- 
ing upon a, x, and f) the symbol may be meaningless or it may be a 
number. Answers to questions depend upon partitions and Riemann 
sums. Partitions are so simple that our little sister can understand them 
completely and be puzzled only by our great interest in them. Riemann 
sums Ef(tž) At, are less simple, but we can construct them in great pro- 
fusion. Matters grow substantially more complex when we ask whether 


e x . e 
there is a number Í, f(t) dt such that to each positive number e there 


corresponds a positive number ô such that 


(4.28) Sf) An — [7 1@ a| <« 


k=1 


We should all recognize this and admit that full comprehensions of 
machinery and its applications is not quickly attained. In fact a sub- 
stantial part of this textbook is devoted to promotion of understanding 


of Riemann sums and their applications. We shall have plenty of oppor- 
tunities to learn. 


whenever P is a partition of the interval a S t £ x for which |P| < 6. 
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So far, the integral in (4.271) has been defined only when x > a. We 
now complete the definition by setting 


(4.281) [PrOoa=0, [EIO d= - [pm a, 


the second formula being valid when x < a and f is integrable over the 
interval from x to a. 


Problems 4.29 


1 Practice the art of telling how the number 


in F(t) dt 


is defined. Be prepared to give the full details, including Riemann sums, at 
any time. 

2 Tell whether you think it wise to abbreviate the statement “To each 
positive number e€ there corresponds a positive number 6 such that 


| 5 f(te) At, — [Pro dt | < 


whenever the sum is a Riemann sum formed for a partition P of the interval 
a Št x having norm less than 6” to the statement 


lim ) FGF) Ay = in ft) dt. 


[PIO p21 


Remark: If you do not have an opinion, think about the matter and get one. 
3 For better or for worse, the “formula” 


lim » FUF) Ay, = f: FO dt 


is considered to be an assertion. Tell precisely what it means. 
4 Tell whether you would like to learn and use a completely new notation by 
which the “formula” 


n 


approx ), flit) An = [°F d 


elP|<ô k=1 


is used to abbreviate the statement that to each positive number e there corre- 
sponds a positive number 6 such that 


) f(t) Atk — [Pro d | <e 
k=1 


whenever P is a partition of the interval a S t £ x for which [P| < 6. Remark: 
If you do not have an opinion, think about the matter and get one. 

5 We often hear about the great scientific progress of our modern era, 
and we should think about anexample. One of the great strides forward is made 
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by abandoning the good old idea that the elementary functions (polynomials, 
trigonometric functions, etcetera) are always the simplest and most useful func. 
tions. There are very many situations in which step functions are the simplest 


and most useful functions. Our first problem is to follow instructions to prove 
that 


(1) Í, °7 dt = 21. 
Draw a figure showing a partition P of the interval 2 S x S 5 into subintervals 
having lengths At), Ate, - + © , At, and observe that 
n 
(2) > An = 3. 
k=1 


Observe that the integral in (1) involves the function f for which f(x) is always 7. 
Show that, with the notation of the text, 


(3) RS = >, fU*) An = >, 7An 
k=1 k=1 
=7 >, Ay =7:3 = 21 
k=1 


Since each Riemann sum is 21, it is quite apparent that RS is near 21 whenever 
the norm of P is near 0. This proves the formula (1). 
6 Supposing that a < b and k is a constant, prove that 


(1) f bat = BO - a) 


Remark: We are (or soon will be) authorities on areas of rectangular regions. 
We can observe that if k > 0, then the right side of (1) is the area of a rectangu- 
lar region having base length (b — a) and height k and hence is the area of the 
region of Figure 4.291 which is bounded by the graphs of the equations x = a, 


Figure 4.291 Figure 4.292 


x = b, y = 0, and y = k. Incase k < 0, we can put (1) in the form 
b 
(2) J. Bat = —(-HO - a), 


where —k > 0, and observe that the right side is the negative of the area of the 
region in Figure 4.292 bounded by the graphs of the equations x = a, x = b, 
y = 0, andy =k. We must always know that areas of rectangles are positive 
The idea that rectangles below the x axis have negative areas is as absurd as the 


idea that cities south of the equator have negative populations. 
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7 This problem requires us to attain a complete understanding of a more 
complex situation. Let 


f(x) = 3 (2<x* <4) 

f(x) =4 (4<% <5) 
and let (2), f(4), and f(5) be defined in any way that pleases (or displeases) the 
fancy. Then Theorem 4.27 implies existence of the integral J in 


I= h FOEN 


We want to find J, that is, to find the numerical value of J. Draw a figure show- 
ing the interval 2 S x S 5 and, in addition, the graph of f. Make a partition P of 
the interval 2 S x < 5 in which 4 is one of the partition points. Choose the 
points tz in such a way that they are not at the ends of the intervals in which they 
lie. Show that the terms in the Riemann sum RS can be split into two sums 
RS; and RS» in such a way that RS; contains those terms for which 2 < t7 < 4 
and RS» contains those terms for which 4 < tf < 5. Show that RS, = 6 and 
RS, = 4 and hence that RS = 10. Our next step is to realize what we are 
trying todo. We are not trying to prove that f is integrable and are not required 
to prove that [RS — Z| is small whenever the norm of P is small. We are trying 
to find J, and we can use the known fact that [RS — J| must be near 0 whenever 
the norm of P is near 0. Therefore [RS — Z| must be near 0 whenever P is a 
partition of the type constructed above and the norm of P is near 0. But RS = 
10 for each partition of the type constructed above, and it follows that J = 10. 
Notice that we have, in the course of our work, proved that 


h fd) dt = J, EO dt + I f(t) db. 


Interpret the numerical results in terms of areas of rectangular regions. 

8 Supposing that xı < xz < x3 and that kı and kz are constants, draw a 
figure showing the interval xı £ x S x, and a graph of the function f for which 
F) = f(xe) = f(x) = 0 and 

f(x) = kı (x1 <x< x2) 


F(x) = ke (xo < x < x3). 
Show that 


M f(t) di = Jz FA dt + Ie ft) dt 
= ky(x2 — x1) + k(x — x2). 


Tell how the result can be interpreted in terms of areas of rectangular regions 
when (a) kı and ke are both positive, (b) kı > O and ke < 0, and (c) kı and ke 
are both negative. Explain the manner in which these results can be extended 
to step functions that have constant values over 3 or 300 intervals instead of just 2. 

9 Tell why each of the following Riemann integrals exists or fails to exist. 


1 ly+1 
(a) Í. (2 + 3t + 442) di © Tad 
1 1 z] 
(c) [ope (d) i 7 at 


Hint: Use theorems given in the text. 
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10 In a campaign to obtain good ideas about Riemann sums and integrals, 
we can use the discontinuous function ¢, defined over the interval O $ x < 1, 


for which 


1 1 1 1 1 
d(x) = 0 when x = 1, 5, 3, F, 5, & |. 


(1) 


d(x) = 1 when x = 1 and m is a positive integer. 
m 


Sketch a figure which shows the nature of the graph of ġ. ‘Then mark the end 
points xo, %1, °° * , Xnand the intermediate points xt x3, °° he ofa partition 
P of the interval 0 S x S$ 1 for which [P| < 0.1; to mark the end points of a 
partition for which |P| < 0.0001 would be a tedious operation requiring sharper 
pencils and better microscopes than we normally carry around. ‘Try to see 
reasons why the Riemann sum 


n 
(2) È (i) Am 
must be near 0 whenever |P| is near 0 and hence that 
(3) fy 9G) dx = 0. 


Then start cultivating the art of understanding and originating thoughts about 
Riemann sums more or less like the following. Let ebe a given positive number 
for which O0 < e< 1. Let k = €/10 and suppose at first that |P| Sh. The 
terms of the Riemann sum (2) are all nonnegative. Those terms for which xz 
can be a point of the interval OSxsh contribute at most 2h to the sum. 
Those terms for which x% > A will be O unless xg = 1/m, where m is an integer 
for which 1/m > horm < 1/h. Thus there are less than 1/h nonzero terms for 
which xf > h and (xg) #0. Since each one of these terms can contribute at 
most |P| to the sum of these terms, it follows that the sum of all of these terms 
cannot exceed (1/h)|P|. Therefore, 


(4) 0s $ plaž) Axr S 2h + - = |P < 0.2 + El. 


If we let 6 = 2e?/25, we will have 


n 
(5) 0s ) ob) an < e 
whenever |P| < ô. This implies the first and hence the second of the formulas 


(6) lim y b(x*) Aap = 0, IRO dx = 0. 


I1 Supposing that g is the corn-popper function of Problem 16 of Problems 
3.49, determine the value (if any) of h g(x) dx. 
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12 Our purpose is to discover that some very obvious and superficially useless 
remarks about Riemann sums lead to the very useful conclusion that the formula 


(1) [Pio a= fre out ap du 


is correct whenever p and g are constants for which p # 0 and the integral on the 


left exists. Let us begin by looking at (1). If we suppose that two variables ż 
and u are related by the formulas 


— _t—q di 
(2) t=pu+g, u= J = Ps 

we can put? = pu + g and dt = (dt/du) du in everything after the integral sign 
in the left side of (1) to obtain everything after the integral sign in the right side 
of (1). We can observe that the lower limit of integration on the right side is 
the value attained by u when ż is the lower limit of integration on the left side. 
Similarly, the upper limit of integration on the right side is the value attained 
by u when ż is the upper limit of integration on the left side. Of course, we are 
entitled to take a dim view of these manipulations until we discover how simple 
and useful they are. Meanwhile, we forget about (1) and start working with 
some Riemann sums. Let P be, as in Figure 4.293, a partition of the interval 


asxtSx having partition points žo #1, ° °°, tn and intermediate points 
* . 
it, t2, °° , ta. Supposing that ¢ and u are related by the formulas (2) and 
that p > 0, we set 
* 
tk — tg — 
(3) m=, =E (b= 1,2,-- pn). 
p p 
To simplify writing, we set 
ty — a tn — x— 
(4) A=- q 1, X = #— e n G 
p b p b 
The numbers wo, ui, °° * , Un and ur, us, ux then form the partition 
a ti t* tt tr x 
(2) (e) o (2) (2) o) (o) 
to t; to tpey É; tn-1 Én 


Figure 4.293 


Figure 4.294 


points and the intermediate points of partition Q of the interval 4 S u S X 
shown in Figure 4.294. Moreover, when 


(5) At, = th — tk—l, Au, = Uk — Uk- 
we find that 


(6) At, = (pur + g) — (pur-ı + 4) = p Aur 
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and 
(7) È fh) An = 2, flouk + 0) Aw 


The result (1) follows from this. Let J denote the left member of (1). Let 
e > 0. There is then a 6 > 0 such that 


(8) |1- È fet) Am < e (PI <8) 


Since (6) implies that |P| = p|Q|, we see from (8) and (7) that 


(9) |z- p puž + q)p Aur | < e (UQI < 4/9). 


But the sum in (9) is a Riemann sum formed for the partition Q of the interval 
A S u S X and the function F having values 


(10) F(u) = f(pu + 9) 


when 4 S u <S X. It therefore follows from the definition of Riemann integrals 
that 


(11) Jè F(u) du = I. 


But (4) and (10) show that the left member of (11) is the right member of (1). 
This proves our conclusion (1) for the case in which p > 0. In case p <0, 
some details must be modified because Figure 4.294 must be turned end-for-end, 
but the result is still correct. The formula (1) which we have proved is called 
the formula for linear changes of variables in Riemann integrals. 

13 By use of formula (1) of Problem 12 show that 


b- 
(a) [e-ora = f ° u? du Hint: Putt — c =u. 
a a— c 
(b) ie sin 2t dt = $ k sin u du 
h 1 1 h/a 1 ; 
(c) J, mee = 3 Í ivf di Hint: Put x = at. 
(a) h 1 J h/a 1 J 
—______. = — dt 
J, Vac” o V1l—#? 
(6) f? sine de = [0~" sin (+A) dx 


Hint: Before you start, replace one of the variables of integration by a different 
variable of integration. 


A [Pie de = [I He + h) ae. 


Remark: This last formula shows that we can add a constant to the variable of 


integration if we subtract it from the limits of integration. This information 
is Sometimes very useful. 
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14 Assuming that the integrals exist, show that 


(1) fE IO dx = f}? ia) ae. 


Remark: This innocent formula and a result of the next section enable us to pro- 
duce the better formula 


(2) J * f) dx = J ’ I dx + Í H f(x) dx 
= h [A(—x) + f(x)] dx. 


h 
This gives the very useful fact that f 1 dx = 0 when f is an odd function, 
that is, f(—*x) = —f(x), and that 


f| 1 dx = 2 [, t dx 


when f is an even function, that is, f(—x) = f(x). 

15 Remark: This remark is designed to indicate that mathematics is a lively 
subject in which even good ideas can be modified in various ways, and that there 
are integrals of many different types. We can be irked by the fact that the Rie- 
mann integral 


h F(x) dx 


does not exist when f is the function for which f(x) = 1 when 0 <x < 1 and 
f(x) =2 when 1 <x <2. The difficulty is that f(1) is undefined and that 
Df(te) At, is undefined when te = 1 for some & If, however, we extend the 
definition of f by setting f(1) = 75, then the new extended function is Riemann 
integrable over the interval 0 Sx <2. We cannot reasonably undertake to 
remove this irksome situation by changing the definition of the Riemann integral, 
because changing basic definitions destroys our means for communication of 
information. We can, however, introduce new types of integrals. We can, for 
example, use the letter F to make us think of a finite set and produce the following 
definition. A function f is Riemann-F integrable over a S x < b if there is a 
finite set F such that f is defined at all points of the interval a S x S b except 
at the points of F and, moreover, there is a number J such that to each e > 0 
there corresponds a number 6 > 0 such that |RS — I| < e whenever RS is a 
Riemann sum for which |P| < 6 and the points t% are all different from points 
of F. This definition does not require f to be defined everywhere overa Sx < b 
and it removes the irritation. Still another definition can be constructed by 
making similar use of the letter C to make us think of a countable set of points, 
this being either a finite set or a set whose elements can be placed in one-to-one 
correspondence with the set 1, 2, 3, - - + of positive integers. A more sophisti- 
cated definition makes use of the letter N to make us think of a null set, this being 
a set having Lebesgue measure 0. As has been remarked, there are many kinds 
of integrals. Mathematicians who use integrals without knowing which ones 
they are using are comparable to chemists who use chemicals without knowing 
which ones they are using. 
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4.3 Properties of integrals In what follows, all integrals bearing 
limits of integration are Riemann integrals. ‘They are limits of Riemann 
sums, and it could be expected that, except for cases in which the inte- 
grands are step functions, it must be impossible to obtain their exact 
values and it must be difficult to obtain reasonably good approximations 
tothem. It turns out, however, that there is a calculus, an invention of 
Newton and Leibniz, by which exact values of very many of the most 
important integrals can be calculated very quickly. Dictionaries tell us 
that a calculus is “a method of computation.’ ‘The particular calculus 
that appears at the end of this section was found to be so overwhelmingly 
important that it came to be known as “the calculus.” This calculus 
enables us, for example, to evaluate the integral in the formula 


(4.31) [ea =F] -g-ga 


by writing nothing more than this. Meanings of words have evolved in 
such a way that we now consider “calculus” or “the calculus” to be a 
name assigned to a part of mathematics involving derivatives and 
integrals. t 

For making calculations involving integrals, we often need the results 
set forth in the following theorems. Proofs of these theorems may be 
omitted; these theorems are rather simple consequences of Theorem 4.26 
and properties of Riemann sums and their limits. 

Theorem 4.32 Iff is integrable over an interval containing a, b, and c, 
then 


EO dt + IEO dt = EO dt. 


Theorem 4.33 Iff and g are integrable over a S x S b and A and B 
are constants, then 


fe [Af + Be] dt = A J fi) di +B JZ g(t) dt 


whenever xı and x lie in the intervala S x S b. 
Theorem 4.34 Ifa < ), if fi, fa, fo are integrable overa S x S b, andif 


filx) S falx) S f:l) (asx sb) 


then 
[no dt S fJ RO dt S f RO dt. 


t Historians who claim that Archimedes knew calculus do not always point out that the 
knowledge was attained posthumously when the meaning of “calculus” changed. Com- 
plete misunderstanding of this matter can serve as a basis for the absurd contention that 
Newton and Leibniz merely rediscovered inventions of Archimedes. 
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Theorem 4.341 If kis a constant, then 


[ee dt = k(x, — x1). 
Theorem 4.342 Ifa < b, if f ts integrable over a S x <S b, and if 


ms f(x) <S M (asx sb) 
then 
m(b — a) £ f f(t) dt < M(b — a) 


and 


Š 
IA 
IA 


1 b 
bos Í f dt = M. 

Theorem 4.343 If f 15 integrable over a S x S b, then so also is the 
function having values |f(x)| and 


[2 #00) dx| s | [ZEON ae| 


whenever x, and x» lie between a and b. 

The next theorem is not so obvious, and it is so important that we shall 
discuss it and prove it. Much of the theory and many of the applications 
of the calculus involve relations between derivatives and integrals. 
Theorems which give information about derivatives of integrals or inte- 
grals of derivatives are called fundamental theorems of the calculus. ‘The 
following theorem is one of the best of these. It has very many applica- 
tions and shows, among other things, that if fis continuous overa S x <S 
b, then there exists a function F for which F” (x) = f(x) whena S x S b. 
In fact, it shows that if f is continuous, then the Riemann integral in 
(4.351) is an “indefinite integral” of f. 

Theorem 4.35 If f is integrable over a S x <S b, then the function F 
defined by 


(4.351) F(x) = k f@ di 


is continuous overa S x X b and 
(4.352) F' (x) = f(x) 


for each x for which f is continuous. 
To start the proof, we observe that if x and x + Ax both lie in the inter- 

val, then 

z+ Az 


(4.353) F(x + Ax) — F(x) = f 


a 


JO dt — [EIO dt = en f(t) dt. 


To prove continuity of F, we use Theorem 4.26 to see that f must be 
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bounded and hence that there is a constant positive M for which —M < 
f(x) S M or |f(x)| S M. Therefore, 


Fe + Ax) —F(@®)| s | f°" mu dt| = M Axl. 
The sandwich theorem then implies that 
(4.354) lim F(x + Ax) = F(x) 
Az—0 
and hence that F is continuous at x. It can be observed that we have 
proved more than was promised; the function F must have bounded 


difference quotients. To prove (4.352), let x be a point at which f is 
continuous. From the two formulas 


Ple t An) ~ FG) = + [10 dt, f(x) = ~ [16 at 


we obtain 

F(x + Ax) — F(x) _ _ 1 z+Az E 
asss PETAD FO fa i O - se a 
Let e > 0. Choose a number 6 > 0 such that 


[f(t) — f(x)| < «/2 (jt — x| < 8). 
Then when |Ax| < 5, we can use Theorems 4.343 and 4.341 to obtain 


Peta =F) _ hay] 3 | [to -ol a| 
s| [t fal - F< 
Therefore, 
(4.356) lim E + Ax) — Fx) _ fa) 
Ax—+0 Ax 


and (4.352) follows from the definition of F’(x). This completes the proof 
of Theorem 4.35. 

Supposing now that f is continuous over a S x < b, we proceed to show 
how Theorem 4.35 can be used to obtain the promised method for evalu- 
ating Riemann integrals. Putting x = ain (4.351) shows that F(a) = 0. 


Putting x = b in (4.351) and then changing the dummy variable of inte- 
gration from ż to x gives 


F(b) = f. flx) dx. 
Therefore, 


(4.36) f: flx) dx = F(b) — F(a). 
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When problems are being solved, it is always convenient to use the bracket 
symbol in the formula 


(4.361) F(x) |’ = FQ) — F(a). 


This symbol can be read “eff of x bracket a, b.” The symbol means 
exactly what the formula says it does; to obtain its value, we write the 
value of F(x) when x has the upper value b and subtract the value of F(x) 


when x has the lower value a. For example, «| = 27 — 8 = 19. Itis 
easy to see that the value of the bracket symbol is unchanged when we 
add a constant to the function appearing in it. Thus 
b 
F(x) + c|; = (FO) +4 — [F(a) + c] = F) — F(a). 
Therefore, we can put (4.36) in the form 
b 
(4.362) J I) de = F(x) +c 


where c is 0 or any other constant. Since we have assumed that f is 
continuous over a S x S b, it is a consequence of Theorem 4.35 that 
F'(x) = f(x) when a Sx <b. Since each function whose derivative 
with respect to x is f(x) must have the form F(x) + c, the result (4.362) 
can be put in the following form. 

Theorem 4.37 Iff is continuous over a S x Sb and if F' (x) = f(x) 
when asx <b, then 


J, f) dx = F(x) |? = FQ) — F(a). 


In substantially all applications of this theorem, the notation of indefi- 
nite integrals is used. In such cases the following version of Theorem 
4.37 gives precisely the information we actually use to evaluate integrals. 


Theorem 4.38 The formula 


(4.381) f KA ax = F(x) |’ = FQ) — F(a) 
is correct if f is continuous overa S x S band 
(4.382) Sf(x) dx = F(x) + c 


whenasxsb. 

When we are able to find a useful expression for the F(x) in (4.382), 
the integral in (4.381) can be evaluated with remarkable ease. We sim- 
ply ignore the limits of integration on the first integral until (4.382) has 
been obtained and then, taking c = 0 unless it seems desirable to give c 
some other value, insert the bracket symbol to obtain (4.381). For 
example, 


228 Integrals 


Problems 4.39 


1 Make a small table of integrals by copying formulas from the second 
column of (4.171) to (4.175). Combine the processes of learning these formulas 
and using them to show that 


1 1 
(a) I, (x — x?) dx = 4 (b) Í x(1 — x) dx = 4 
(c) h x?(1 — x°) dx = fz (d) h x?(1 — x)? dx = x5 
x] 2 
(e) f = dt = log x A |, pgi = blog 5 
(g) N sin x dx = 2 (h) IN cos x dx = 0 
1 2 l1\2 
. -z _ _ ol . t —2 
(i) fie dx =1l1—e (7) Í. (« +5) dx = 72 
1 fÈ i bh) — si 
(k) z, cos (a + bt) di = sin (a t bi) — sin e 
2 Verify the formula 
1 _ p! 
Í (1 — a) de = GaP FT 


for some pairs of small nonnegative integers p and g. Remark: Anyone who 
wishes to augment his corpus of scientific information should be informed that 
this is a famous and important formula. The integral is the beta integral. The 
formula is correct whenever p and g are real or complex numbers with nonnega- 
tive real parts. When Cauchy extensions of Riemann integrals have been defined 
and are used, it can be proved that the formula is correct when p and g are com- 
plex numbers with real parts exceeding — 1. 

3 While we are not now indulging in proofs of such things, the two integrals 


l 1 e] 1 1l 
0) hary™ htr” 


are nearly equal when sis near 1. Nevertheless, we must use different integration 
formulas to evaluate the integrals. Obey the rules and show that 


1 1 21 — 1 
(2) J. CE r (s = 1) 
1 1 
(3) J, Ip” = log 2. 


Remark: While the details need not be fully understood at the present time, we 


pause to learn that the right member of (2) really is near log 2 when s is near 1. 
This means that 


les 
(4) lim Z = log 2. 


To see that (4) is correct, we can let 


(5) f(x) = pt log2 
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and, after observing that x log 2 = log 27 and hence f(x) = 2%, put (4) in the 
form 


6 imta) — £0) 


sl l—-s 


= log 2. 


But it follows from the definition of derivatives that the left member of (6) is 
f'(0). From (5) we find that f’(0) = log 2. Therefore, (6) and (4) are correct. 
The conclusion to be drawn from this story is that the function F defined by 


1 4 
(1 Fe) =f, aay 


is a continuous function and, unless we can find another scapegoat, we must 
blame the well-known perversity of inanimate matter for the strange fact that 
F(s) is expressed in terms of exponentials when s Æ 1 and is expressed as a loga- 
rithm when s = 1. 

4 This problem, like very many of the fundamental problems of science, 
requires much more looking and thinking than calculating. Look at Figure 
4.391, which shows the graph of a step function f defined over the interval 
a Š x S b, and observe that f(x) 2 0. Remember that, in the problems at the 
end of Section 4.2, we discovered (or almost discovered) that 


+ 


(1) f. fe dx = |S), 


where |S] is the area of the set S of points (x,y) for which a S x S bandOSyS 
f(x). The next step is the most difficult one. We should realize that, at the 
present time, our ideas about areas of nonrectangular point sets are at best some- 
what vague and nebulous and are at worst nonexistent or even erroneous. The 
rest of our work is much easier. We look at Figure 4.392, which shows the graph 
of f over the interval a S x < b for the special case in which f(x) = x2, a = 0, 
and b = 1. As above, let S be the set of points (x,y) for which a S x < b and 
Osysf(x). Our next step is to look again at Figure 4.392 and express the 


a b x 1 x 


Figure 4.391 Figure 4.392 


cheerful opinion that the set S ought to have an area which we can denote by 
|S| and that the formula (1), which holds whenever f is a nonnegative step func- 
tion, ought to hold whenever f is a nonnegative integrable function. Our final 
step is to seek what a physicist could call experimental verification of this cheerful 
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opinion. Look at Figure 4.392 and note that S seems to fill up about one-third 
of the square having opposite vertices at the points (0,0) and (1,1), and hence 


that the area [S| of S should be about one-third. Now comes the calculation. 
Show that 


1 
1 
x? dx = %. 
h = 


5 Sketch a graph of the equation y = x + 1 over the interval 1 Sx <3 
and use elementary geometrical ideas to find the-area of the part of the plane 
bounded by this graph and the graphs of the equations x = 1, x = 3, andy = 0, 
Then evaluate the integral 


in (x + 1) dx 


and find out whether we obtain more experimental verification of the cheerful 
opinion of the preceding problem. 

6 Figures 4.393 and 4.394 show graphs of y = sin x and y = cos x over the 
interval 0 S x Sa. Observe that a particular region of Figure 4.393 seems to 


Figure 4.393 Figure 4.394 


fill about two-thirds of the enclosing rectangle and hence that the region ought 
to have area about 27/3. Then obtain the first of the formulas 


e w/2 
|p sine de = 2, J cos x dx = I, [ cos x dx = —l 
0 0 w/2 


and give an interpretation of the result. Then obtain the second and third 
formulas and interpret the results in terms of regions in Figure 4.394. 


7 Prove that if «v and v have continuous derivatives over the interval 
a S x < b, then 


J: u(x)o' (x) dæ = u(x)o(a) |’ — J: o(x)u’ (x) dx. 

Hint: Decide how the formula 
b b 
f Fe) de = F@ |. 


can be used. Remark: The formula to be proved is one of the most useful 
formulas in the calculus; it is the formula for integration by parts. 
8 Some of the most important applications of integrals involve inequalities, 
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and we look at an example. Let 4 be a positive number and start with the fact 
that 


(1) l Seset 


when 0 S x SA. Replace x by żin (1) and integrate over the interval 0 <i £ 
x to obtain, with the aid of Theorem 4.34, 


(2) xs lSets (0 Sx S4). 
Replace x by ż in (2) and integrate over the interval 0 <£ ¢ < x to obtain 

x? x? 
(3) zSsr-Uta seta (O <x <s £). 


Continue the process to obtain 


2 3 
® ase- (lteth) se5 
4 
(5) pee —(te+8+5) sak 


when 0 Sx S 4. Remark: Continuation of the process (with the aid of mathe- 
matical induction) shows that, for each positive integer n, 


x” xn xn 
(6) Sse —-(Qteth+he--- +8) sak 


While we now have so many other things to do that we shall not look at the details, 
we can observe that (6) provides a straightforward and foolproof way to obtain 
decimal approximations to ¢%, e, e?, etcetera, correct to 4 or 40 decimal places. 
We can discard much of the information in (6) by observing that x"/n! approaches 
0 as n— œ and hence that 


(7) = lim (I++ t5) 


n—> œ n! 


The formula (7) is the spectacular one, but (6) is often much more useful. We 
shall learn more about these things later. 
9 Applying the idea of the preceding problem to the inequality 


—1 < sinx $l, 


show that 
—x < l — cosx S x, 
2 2 
x x 
——~—Sx-—sinxs = 
2 2 


when x > 0. Remark: More extensive information will appear in a problem of 
Section 8.2. 
10 The Bernoulli functions Bo(x), B(x), Bo(x), - + > satisfy the conditions 


(1) Bo(x) = 
(2) Bi (x) = B,-1() (n = 1,2,3, . °) 
© h Bale) de = 0 (a = 1,2,3, °°) 
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over the interval ~œ < x < œ, except that (2) fails to hold when z is 1 or? 
and xis an integer. They all have period 1, that is, Ba(x + 1) = Ba(x) for each 
n and x. They are all continuous except that 
By(x), the saw-tooth function having the graph 
shown in Figure 4.395, is discontinuous at the 
integers. In fact, B(x) = 0 when xis an integer 
and 


Figure 4.395 (4) B(x) = x — [x] — # 


LAY IAL Os 
VAS VAR VAB\VAB AE) 


when x is not an integer. Show that (1) and (2) imply existence of constants 
Bo, Bi, B2, > + -+ such that, when 0 < x < 1 and 0! = 1 as usual, 


(5) Bo(x) = a, 

(5.1) By(x) = aa + at 

(5.2) B(x) = aor + ae + at 

(5.3) Bola) = BE 4 BE Sy ae 

(5.4) Bi(x) = A + as + aor + at ral 


and write two more of these formulas. Because of continuity, each of these 
formulas except (5.1) holds when 0 Sx <1. The numbers Bo, Bi, Bo, 0°: 
are the Bernoulli numbers and, when n 2 2, 


(6) B, = n'B,(0). 

Show that the above formulas can be put in the neater forms 
(7) O!Bo(x) = 

(7.1) WBi(~) = Box + Bi 

(7.2) 2!B2(x) = Box? + 2Bix + Bo 

(7.3) 31B3(x) = Box? + 3Bix? + 3Box + B; 

(7.4) 4!Ba(x) = Boxt + 4Bix3? + 6Box? + 4Bax + B; 


involving binomial coefficients and write two more of these equations. Use (3) 
to show that, when n 2 2, 


Ba(1) — By(0) = f, Bale) dx = fy Baa) dx = 0 


and hence 


Use (1) and (7) to show that Bo = 1. Then use (7.2) and (8) to show that 
Bı = —§. Then use (7.3) and (8) to show that B: = $. Then calculate one 
or two more Bernoulli numbers. Remark: Bernoulli functions and numbers 
have important applications and some people know very much about them. 


It can be shown that Bp = 1, B, = —4, Be = 4, Bs = 0, Ba = —v¥o, Bs = 0, 
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B; = ds, Br = 0, Bs = — y5, and that | Ban] is very large when n is large. Some 
books, particularly those that give a few formulas involving Bernoulli numbers 
but do not treat Bernoulli functions, use notation which conflicts with the nota- 
tion used above. 

11 Prove that if f is integrable over the interval 0 < x < 1, then 


o miS) - [fore 


k= 


Solution: To keep all of the bewitching mysticism of mysterious mathematics 
out of our solution, let e be a given positive number. Choose a positive number 
ô such that 


(2) Ly f(g) Axy — h t de | <e 


whenever the sum is a Riemann sum formed for a partition P of the interval 
0 £x S 1 for which |P| < ô. Let N be an integer for which N > 2 and N > 
1/6. Let n be an integer greater than N. Let P, be a partition of the interval 
0 <x S1 into n equal subintervals each having length 1/n. Then x, = k/n 
foreach k. Let xt = % so that xe = k/n for each k. Since Ax, = xp — X- = 
1/n for each k, we see that |P,| = 1/n. Since n > N, we have 1/n < 1/N and 
hence 1/n < 6. Therefore, |Pal < 6 and (2) holds when the sum is the Riemann 
sum formed for the partition Pa. But for the partition P, we have x; = k/n 
and Ax, = 1/n, so 


3 S Hat) Am = S JH S 5 
(3) PRGS x iG - A ; 
It follows that 


(4) <e 


PHO - f NO ae 


when n > N, and this gives the desired conclusion (1). 

12 The basic formula (1) of Problem 11 has numerous quite astonishing 
applications. Letting s be a nonnegative number and letting f(x) = x*, use the 
formula to prove that 


(1) im PEZI +o te 1 


n—> co ntl l+s 


Write the formulas to which this reduces when s = 0, $, 1, $, 2, and 3. Remark: 
Textbooks that specialize in proofs by mathematical induction give the formulas 


(2) 1 +2 +3 +o tn = MOTD 
(3) Hah Hepes fara MOTD En FD) 
(4) i4+723+384.-.-- +n = OT 
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With the aid of these formulas, it is easy to verify (1) for the cases in which ; 
is 1 and 2 and 3. In fact, Archimedes did it. The formulas of this problem 
have tremendous importance in the history of science because they stimulated 
interest in limits of sums that culminated in the invention of “‘the calculus” 
by Leibniz and Newton. 


13 Letting f(x) = (1 + *)*, where s is a constant for which s = —1, derive 
the formula 


lim (n+ 1)?t(im+2)?+ +--+ +t) _ att—il 


n— 0 net! stl 


Write the formulas to which this reduces when s has the values —2, 
1, and 2. 


14 Letting f(x) = (1 + x)7}, derive the formula 
1 1 1 


ati apt ta 5) = 1982 


15 Letting f(x) = 2x/(1 + x*)*, where s is a constant for which s # ~1, 
derive the formula 


1 1 
`y, 0, %) 


lim 


1 
n— œo „Fit 


. 2—2 1 2 3 a 
lim # | oa Fis Ge Dy E Gay 
n 1 1 
+ ray] = =H Lh 


Write the formulas to which this reduces when s has the values —2, —1, —3, 0, 
1 
gz, and 1. 


16 Letting f(x) = 2x/(1 + x), derive the formula 


. 1 2 3 n 1 
2 ——— + ————__ + —_—__. + o è o + —————— = — 
yim n E +12 nR 9 n?+ 3? n? + a] 2 log 2. 
17 Letting f(x) = IT d b ing the fact that f l dx =" 
ing f(x) = pays and borrowing the fac at J Tae =] 


derive the implausible formula 


. 1 1 1 1 T 
lim (apptapntapat +525) =F 


18 Persons are sometimes credited with substantial knowledge of calculus 
when they can simplify 


(1) 4f” e dt. 
dx Jo 


The problem can and should be solved by noticing that putting f(t) = e~* enables 
us to use the fundamental theorem of the calculus (Theorem 4.35) to obtain 


2 £ [TIO dt = fw). 
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When (2) holds and vu is a differentiable function of x, we can use the chain rule 
to obtain 


(3) 4 f oa [2 [moa] 
= fu) FE 


Use these ideas to show that (1) is 2xe™=‘, 
19 Letting 


F(x) = i e™ dt, 


use the ideas of the preceding problem to obtain a simple formula for F’(«). 
Then find F(x), that is, find a simpler expression for F(x), and differentiate it to 
obtain F’(x). Make the results agree. 

20 Prove that if f is continuous and u and v are differentiable, then 


d f” — du _ do 
5, [10 # = 1m 3-1 
Hint: Use the formula 


[O= [roa [roa 


and the ideas of Problem 18: 
21 Supposing that A is a positive constant, x > 0, and 


F(x) = f az l dt, 


show that F’(x) = 0 without use of the formula J T = log ż +c. 


4.4 Areas and integrals We all know what is meant by a rectangular 

region R having base length b and height h. When the x axis of a rec- 

tangular coordinate system is parallel to the base, R is the set of points 

(x,y) for which xo S x S xo + b and 

yo S y S yo+ h as in Figure 4.41. 

We are all familiar with the idea that yo+h F “a 
h 


il tile 
le 


R has an area and that this area is ili ait: 
bh, the product of the base length 
and the height. There is an old- 
fashioned view that this matter is Figure 4.41 
quite simple, but modern mathema- 

ticians, like modern atomic physicists, find that there is much to be 
learned about things that our ancestors thought were simple. It is 
quite absurd to presume that it is easy to prove that the area of R is bk; 
in fact it is quite absurd to presume that it is possible to prove that there 
is a number (bh or not) which is the area of R unless we have some defini- 
tions or postulates or something upon which proofs can be based. We 


r l 

si itil 
dys 
fit 
LES 
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escape this awkward situation with the aid of a definition designed for 
the purpose. 

Definition 4.42 If Ris a rectangular region having base length b and 
height h, then the product bh is called the area of R. This area is denoted} 
by the symbol |R| so that |R| = bh. 

This takes care of the matter of areas of rectangular regions, but we 
are not yet out of trouble. Let T be the triangular set consisting of those 
points within the rectangular region of Figure 4.41 which lie on and 
beneath the diagonal drawn there. When we try to decide whether there 
is a number which is the area of T, we find that we still need definitions 
or postulates or something before we can do anything. If we try to 
take care of triangular regions, circular disks, circular sectors, and sets 
of other special types by hordes of special definitions, we will find our- 
selves forever wallowing in confusion. While students in elementary 
calculus courses are normally not expected to know much if anything 
about the matter, we should at least know that our friend Lebesgue con- 
structed a theory of area which is usually called the theory of Lebesgue 

two-dimensional measure. ‘This eliminates the 

s confusion and is now very important in applied 

as well as in pure mathematics. We should not 

be injured and may possibly be benefited by a brief 

look at the Lebesgue theory. Let S be a set of 
Figure 4.43 points (x,y) which is contained in but does not 

completely fill a rectangle R. Figure 4.43 may be 
helpful, but may also be misleading because the set S need not look 
at all like the one shown in the figure. Let S’ be the set of points in R 
but not in S. 

Definition 4.44 The set S is said to have area (or two-dimensional 
Lebesgue measure) |S| if |S| is a number such that to each e > O there corre- 


spond (i) a countable collection Ri, Ro, Rs, ` > + of rectangular regions such 
that each point of S lies in at least one of these regions and 
(4.45) JRil + Ro] +--+ + (Ral < (S| +e (n =1,2,3,° °°) 


and (ii) another countable collection Rj, Ra, Rz, °° > of rectangular regions 
such that each point of S' lies in at least one of these regions and 


(4.46) |R| + IRI + ©- + IRI < |R| — |S| te (n = 1,2,3, © +°). 


t This notation accords with a general principle with which we are slowly becoming 
acquainted. If Qisa number ora partition or a point set or perhaps even an assertion or a 
crate of oranges, we expect |Q| to be a real nonnegative number which is associated with Q 
in some particular way and is, in some sense or other, a measure or a norm or a value of Q. 
The simplest useful example is that in which Q is a real number and |Q] is its absolute value. 
When applications of areas are involved, it is often necessary to recognize that h and & are 
numbers representing lengths measured in particular units (say centimeters) and that the 
area is a number of appropriate “square units” (say, square centimeters). 
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It is possible to describe complicated rules for constructing sets S for 
which no such number |S| exists, and we say that such sets do not possess 
areat (or are nonmeasurable). However, such sets are much more com- 
plicated than those that appear in this book. This discussion of areas 
will have served a purpose if it provides a reason for acceptance of the 
fact that the theory of area is much more complicated than the theory of 
Riemann integrals and that intuitive ideas about areas do not provide a 
satisfactory basis for proofs of theorems about Riemann integrals. We 
can, however, be reassured by the facts that many of the results of the 
theory of area are thoroughly simple and that they are in complete agree- 
ment with all of the results we shall obtain by use of Riemann integrals. 
We shall not use Riemann integrals to obtain illusory information about 
areas of sets that do not possess areas. More information about this 
matter will appear in Section 5.7. 

In what follows, we suppose that M, a, and b are constants and that 
fis a function, Riemann integrable over a S x < b, for which 0 < fæ) S 
Mwhena Sx Sb. LetS be the set of points (x,y) for whicha S x < b 
and0 Sy <S f(x). The set S may look more or less like the sets shown in 
Figures 4.471 and 4.472. In each case we can describe S as the set of 


b x b x 
Figure 4.471 Figure 4.472 


points or part of the plane or region bounded on the left and right by the 
graphs of the equations x = a and x = b and bounded below and above 
by the graphs of the equations y = 0 and y = f(x). In case f is con- 
tinuous and the graph of y = f(x) looks like that shown in Figure 4.472, 
we can comfortably describe S as the region bounded by the graphs of 
the four equations. Í 


t Newspapers and magazines keep us permanently aware of the fact that there are 
inadequacies in old-fashioned intuitive physical theories of matter and that these intuitive 
theories do not provide an adequate basis for modern physics. Since these newspapers and 
magazines keep us quite generally uninformed about theories of areas and volumes, it may 
be necessary to consult Appendix 2 at the end of this book to learn that there are bugs in 
intuitive theories of areas and volumes. 

t Of course climatologists who talk about areas of abundant rainfall, and philosophers 
who talk about areas of scientific thought, could be expected to call S the area bounded by 
the graphs. But in mathematics and perhaps even in climatology (we never know about 
philosophy) an area is always a number and scientists do not, in their most brilliant 
moments, call S an area. 
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When an enlightened scientist must calculate the area |S| of S, he writes 


(4.48) IS] = lim ) fw) Ax = J E) ax 


and, at least in simple cases, evaluates the integral with the aid of Theorem 
4.38. The procedure by which (4.48) is obtained must now be explained. 
The first step is to sketch an appropriate figure which will look more or 
less like Figure 4.471 or Figure 4.472. The next step is to make a parti- 
tion P of the interval a £ x < b into subintervals, but we do not bother 
to draw more than one of the subintervals. Without bothering with 
subscripts and stars, we let Ax denote the length of the interval and let x 
be a point of the interval. We then draw the rectangle whose width is 
Ax and whose height is f(x). The first step in building the formula 
(4.48) is to write f(x) Ax, because this is the area of the rectangle (or 
rectangular region). We then tell ourselves that this area is a good 
approximation to the area of the part of S that lies between the vertical 
sides of the rectangle, and, while this is no time to get excited about the 
matter, we could tell ourselves that the two areas might be exactly equal 
if we choose the x shrewdly enough. The next step is to add the area of 
the rectangle we have drawn to the areas of the other rectangles which we 
have not drawn to obtain Zf(x) Ax. Even if we did not know in advance 
that lim Zf(x) Ax exists, we should have a feeling that Zf(x) Ax should be 
near |S| whenever the numbers Ax are all small (that is, whenever the 
norm of P is near zero) and hence we should write 


(4.481) IS| = lim Zf(x) Ax. 


The final step is to recognize that the right side of this equation is the 
limit of Riemann sums and hence is the Riemann integral in (4.48). 
The ritual involving partitioning (or splitting up), estimating, summing 
(or adding), and taking a limit to obtain a Riemann integral equal to a 
number in which we are interested is known as “‘the process for setting up 
the integral.” The ability to “set up integrals” efficiently and correctly 
is very valuable, and problems in calculus textbooks that require the 
finding of areas are designed to promote abilities in this art. Students 
cannot know, unless they are told, that they are wasting their time if they 
never bother to set up integrals but only use remembered formulas to 
calculate areas and volumes and the ubiquitous moments of inertia. 


Problems 4.49 


I Figure 4.491 shows graphs of two equations y = f;(x) and y = fe(x) 
which intersect at the points (—2,—6), (0,0), and (2,6). The graphs bound 


4.4 Areas and integrals 239 


two regions Rı and R Use partitions and Riemann sums to obtain the 
formulas 


Rl = f,O de RA = [ G) — Ala) ae 
[Ri] + |R| = ia [file — fo(x)| dx. 


Remark: The widths and heights of rectangles are always positive, and mistakes 
in sign are undesirable. When hasty calculations indicate that an area or a 
population of a city is negative, the calculations 
should be examined. 

2 The graphs in Figure 4.491 are graphs of 
y = 3x and y=x3—-x. Find |Ri| + |R], this 
being the sum of the areas of the two regions 
bounded by the graphs. Ans.: 8. 

3 With Figure 4.492 to provide assistance, 
make a partition of the interval 0 £ x < 2 to ob- 
tain the area |S,| of the set Sı bounded by the 
graphs of y = 0, x = 2, and y = x2. Try to repair 


Figure 4.491 Figure 4.492 


the work if the result does not have reasonable agreement with an estimate made 
by counting squares and partial squares included by Sı. Then interchange the 
roles of x and y to find the area |S,| of the set S bounded by the graphs of x = 0, 
y = x? andy = 4. Makea partition of the interval 0 S y < 4and be sure that 
the correct integrand and limits of integration appear in the calculation 


[So] = lim X f(o) Ay = |" 0) ay. 


In this case also, try to repair the work if the result clashes with the result of 
counting squares. Finally, have another look at Figure 4.492 and see what 
IS:| + [S2] should be. _ 

4 Referring again to Figure 4.492, obtain |S2| by starting with a partition 
of the interval 0 < x < 2 and using an estimate of the area of the part of S: 
that stands above the interval of length Ax (or Ax) containing the point x (or xx). 
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5 Use the technique of the text to find the area of the triangular Patch 
bounded by the lines having the equations y = 2x, y = 0, and x = 3. Check 
your answer by use of elementary geometry. 

6 Let 4 be the area of the region bounded by the x axis and the graph of 
the equation y = x(1 — x). Sketch an appropriate graph showing a sample 
rectangle and fill in the details involving the formula 


A= lim ) (1 — x) Ax = in (x — x?) dx = =. 


7 Find the area of the region bounded by the coordinate axes and the graph 
of the equation y = x3 — 8. Ans.: 12. 
8 Find the area of the part of the plane bounded by the graphs of the equa- 
tions y = x? — 3x andy = x. Ans.: 8. 
9 Find the area of the region bounded by the graphs of the equations y = x, 
y = 2x, and y = x% Ans.: $. 
10 Find the area of the region in the first quadrant bounded by the x azis 
and the graphs of the equations y = x andy = 2 — x2. Ans.: (8 +/2 — 7)/6. 
11 Let 4 be the area of the part of the plane which lies between the lines 
having the equations x = 7 and x = 2m and is bounded by the x axis and the 
graph of the equation y = sin x. Sketch an appropriate graph showing a sample 
rectangle and, observing that the height of the rectangle is the positive number 
— sin x (not the negative number sin x), fill in the details involving the formula 


4 = lim ) (- sin x) Ax = — [7 sin x de = 2, 


12 Someday we will be able to show that the graph of the equation x% + y» 
= a% is, when ais a positive constant, a part of a parabola. Find the area of the 
region bounded by the graph and the coordinate axes. Ans.: a?/6. 

13 Is the area of the region bounded by the graphs of the equations 


y = x3 4+ x, y= x3 +1 
the same as the area of the region bounded by the graphs of the equations 
y= x?, y= 1? 


14 The graph of each of the following equations contains a loop; determine 
the nature of the graph and find the area of the region bounded by the loop, it 
being assumed that a is a positive constant. 


(a) y? = x(a — x)? Ans.: 1ga” 
(b) y? = x(x — a)? Ans.: Tga” 


15 The graphs of the equations y = $x? and y = x + 4 bound a region R. 
With the aid of a reasonably good figure, make an estimate of the area |R] of R. 
Then find |R| by making partitions of an appropriate part of the x axis so that 
vertical strips appear in the calculation. Then find |R| by a method in which 
horizontal strips appear. Make the results agree with each other and use your 
estimate to provide assurance that the two answers are reasonable. 
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16 Let £ be the area of the circular disk of 
radius a shown in Figure 4.493. Explain the 
ideas associated with the calculation 


A = lim X 2rx Ax = 2r EZ: = 7a’, 


Hini: Think of the ring between the two inner 
circles as being a ribbon of width Ax and length 
2rx, the length being (by definition of r) the cir- Figure 4.493 
cumference of a circle of radius x. 

17 Sketch graphs of sin x and cos x over the interval 0 < x Sa and then, 
with the aid of this information, sketch graphs of sin? x and cos? x. Use these 
graphs to obtain a reason why it should be true that 


(1) Ir sin? x dx = in cos? x dx. 
Note also that 
(2) IN (sin? x + cos? x) dx = JN ldax=r. 


What can we now conclude about the integrals in (1)? Taking a totally different 
tack, use the formulas 


. 1 - 2 
3) inta = LTS, cage y = Lt 8 2e 


to evaluate the integrals in (1). Make all of the results agree. 
18 Prove the formula 


by observing that the integrand is nonnegative and constructing a region of 
which the integral is the area. Hint: Let y = ya? — x? and, after tinkering 
with this equation, draw an appropriate figure. 

19 Let a and b be constants for which O0 <b <a. Show that if y 2 0, 
0x S a, and 


xg? 2 
then 
y= Vane 


Let S be the set of points inside the graph of (1); as we shall learn later, the graph 
is an ellipse. With the aid of Figure 4.494 show 


that Figure 4.494 


ci 
SN l i 


With the aid of the preceding problem, show that 
|S] = rab. This is a result that many people 
remember: the area of a circular disk is maa and 
the area of an elliptic disk is mab. 
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20 This problem is interesting because it shows how a basic formula involving 
areas (a well-known formula which we have not yet proved) can be used to obtain 
preliminary derivations of formulas involving trigonometric and inverse trigo- 
nometric functions. Rigorous derivations will be given in Chapter 8. Supposing 
that 0 < ż < a, construct and look at an appropriate figure to derive the formula 


t m . t 
(1) f «a? — x? dx = ft Va? — 2 + ga? sin”! - 
a 


in which the first term on the right is the area of a particular triangle and the 
last term is the area of a circular sector having radius a and central angle 9, 
where 0 = sin™! (t/a) and O < 0 < 2/2. Anyone who is short on information 
about areas of circular sectors is reminded that the area of a sector having central 
angle 0 is, as it ought to be, the product of 6/2m and the area ma? of the whole 
circle. We can suddenly become interested in (1) if we realize that we have 
theorems and rules that enable us to write formulas for the derivatives with 
respect to f of everything in it except the last term and hence that we can obtain 


a formula for the derivative of the last term. To capitalize this idea, put (1) in 
the form 


f p 
(2) sat = iS VET ade -i VETE] 


and then differentiate and simplify results to obtain the formula 


, daat 
(3) a?” a \/ a? — :? 


Remark: We invest a moment to look at the formula 


d.. 1 
(4) aint = SR 


to which (3) reduces when a = 1. At least in the case where 0 < ż < 1 and 
0 < 0 < 27/2, trigonometry books emphasize the fact that the angle 6 of Figure 
4.495 is “the angle whose sine is 2” or “the inverse sine of #,” so that 8 = sin! ż 


> 
Figure 4.495 Figure 4.496 


and = sin 6. The graph in Figure 4.496 shows how 6 and are related. The 
relation (4) is equivalent to the relation 


(5) lim 49 = —1L_ 
At—0 At V 1 — t? 
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and this is equivalent to the relation 


At —— 
16) lim 79 = Vi-# = V1 — sint 6 = cos 6 


A6—0 


or to the first of the formulas 
d , d 
(7) 76 30 6 = cos 6, zg °S 6 = — sin 6. 
The second follows from the first and the calculation 
d d . {T 
(8) F008 8 = gz sin (3 — 8) = — cos (3 — 0) = — sing, 


21 Show how formulas of the preceding problem can be used to obtain the 
integration formulas 


1 ; x% 
[ VE ax = hy JP a + dat sin 4 ¢ 
a 


1 . X 
Era te 


Then keep in contact with the external world by finding these formulas in your 
book of tables. 

22 Sketch a few figures which illustrate applications of the following fact. 
If f is integrable (and hence bounded) over a S x S b, we can choose a positive 
constant B such that f(x) + B > 0 when a S$ x < b and write the formula 


EG dx = f [f(~) + B] dx — fB dx, 


b 
which shows that J f(x) dx is the result of subtracting the area of a recrangle 
Q 


from the area of the set of points (x,y) for which a S x S band —B Sy < f(x). 
Remark: This problem and the next provide ways of reducing questions involving 
integrals to questions involving integrals with nonnegative integrands. 

23 Sketch a few figures which illustrate applications of the following fact. 
If f is integrable (and hence bounded) over a S x S b, so also are the functions 
g and k defined by 


g(x) 


OERA ag) 


_ [f@) = fe) 
2 


Moreover, g(x) 2 0, h(x) 2 0, f(x) = g(x) — h(x), |J) = e6) + h(x), and 


[i dx = J: g(x) dx — J: h(x) dx 
J. [f(x)| dx = f: g(x) dx + J. h(x) dx. 
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24 If f(x) = |x|, then f'(x) = sgn x except when x =0. Whena<0< b, 
Theorem 4.37 does not guarantee correctness of the formula 


b 
[? sen x dx = Ia | = |è] — lal, 
a a 


but the formula may be correct anyway. What are the facts? Ans.: The 
formula is correct. 
25 This remark is dedicated to a distinguished professor in a distinguished 
university in New Jersey. He claimed that it does not make sense to ask a 
2 . . 
student to evaluate the integral Í x3 dx. The man was right. The integral 


is a number, the limit of Riemann sums, and the number is 4. Thus, the man 
was insisting that it does not make sense to ask a student to evaluate 4. What 


, 2 
the foxy professor really wanted to do was to emphasize the fact that I, x3 dx 


is something more than some black ink on white paper. Itis a number. There 
are times when the thing is called a symbol, but it is not a symbol. The fact 


that fo x’ ds |" = 16 would be hard to explain if the thing were considered 


to be a symbol because we do not square symbols to get 16; we square numbers to 
get 16. We must agree that we should know what we are doing when we are 


2 
asked to “evaluate” f x3 dx and then go to work to find that the ‘‘answer’ ı3 


4. A few thoughts about these matters may even pay off sometime. 


4.5 Volumes and integrals It could hardly be expected that funda- 
mental ideas and definitions involving volumes of sets in £3 could be 
simpler than the corresponding ideas and definitions involving areas of 
sets in Ho. In the best treatments of the subject, the volume of a set is 
its three-dimensional Lebesgue measure. The theory begins modestly 
with the definition which asserts that the volume V of a rectangular 
parallepiped (or brick or three-dimensional interval) having length a, 
width b, and height ¢ is the product of the dimensions, so that V = abe. 
In the theory of volumes, bricks play the same role that rectangles play 
in the theory of areas of sets in E. It turns out that each bounded set 
in Æ; that we shall dream of considering has associated with it a number 
which is the volume of the set. If two of our sets S, and Sz are such that 
Sı is a subset of Se, which means that each point of Sı is also a point of 
S2, we can be sure that the volume |S,| of Sı is less than or equal to the 
volume |Sə| of Sə} If a set S is composed of two parts Sı and Sz which 
have no points in common, we can be sure that |S| = |S,| + |S2|. If one 
of our sets Sı has a volume |S,| and if Sz is another set congruent to Sı, 
then S2 has a volume and |S2| = |S,|.. Appendix 2 at the end of this book 
shows that the theory of volumes is (like the theory of ‘‘solid’’ physical 
matter) not as simple as the naive believe. While a full discussion of 
volumes lies far beyond the scope of this book, the theory of Lebesgue 
measure in Æ; justifies all of the methods we shall use for finding volumes. 
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We now illustrate the “‘slab method” for finding volumes of three- 
dimensional sets that are commonly called “solids.” With the expecta- 
tion that the method will be fully 
understood and applied to find vol- 
umes of other solids, we find the vol- 
ume of the solid cone of Figure 4.51 
which consists of the points in Æ; 
lying between the planes x = 0 and 
x = h and inside or on the conical Figure 4.51 
surface. When we are not required 
to explain the details of the method, we solve this problem in two lines 
by writing 


(4.52) y = lim > A(x) dx = f AG) dx 
h fb XÑ mb? x3]? 
-fG = 325], - 3 


Even when we are not required to give explanations to someone else, we 
do not write this without talking to ourselves. We make a partition P 
of the interval 0 < x <S h, but we draw only one subinterval having 
length Ax and let x be a point of the subinterval. Planes perpendicular 
to the x axis at the ends of the interval have between them a part of the 
solid that we can call a slab. Let 4(x) be the area of the section in which 
the solid intersects the plane which contains the point we have selected 
and is perpendicular to the x axis. In case |P| is small, the number 
A(x) Ax is exactly equal to the volume of our slab or is a good approxi- 
mation to it. We next write 


(4.53) DA (x) Ax 


and tell ourselves that this is either exactly or approximately the sum of 
the volumes of the slabs and hence is exactly or approximately V. Hence 
it should be true that 


(4.54) V = lim DA(x) Ax. 


But the right side of this formula is the integral in (4.52). Our next step 
is to observe that A(x) is the area of a circular disk whose radius y is 


such that y/x = b/h and hence y = (b/h)x. Thus 


_ _fbe\? _ rh? , 
(4.55) Ae) = (FY = 5 a 
With this information, we can quickly complete the work in (4.52). 
Observe that it would not be easy to find the volume of the solid cone of 
Figure 4.51 by employing slabs resulting from a partition of the interval 
—b S y < b of the y axis. The difficulty resides in the fact that planes 
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perpendicular to the y axis intersect the solid in plane sets the areas of 
which are not easily found. 

Finally, we illustrate the “cylindrical shell method” for finding volumes 
of solids by finding the volume of a solid cone in another way. We 
consider the solid cone to be the solid obtained by rotating, about the y 
axis, the triangular region T in which the solid cone intersects the first 

quadrant of the xy plane. This re- 

y a gion appears in Figure 4.56. This 

b time we make a partition P of the 
interval OS y Sb of the y azis. 
6— When yrk-ı S yi S yx, the lines in 

x the xy plane having the equations 
y = Yk—ı and y = yz, cut from Ta 
— strip approximating a rectangular 
Figure 4.56 region of length [k — (h/b)y#] and 
width Ay,. When this rectangular 
region is rotated about the x axis, it generates a cylindrical shell resem- 
bling a tin tomato can from which both top and bottom have been 
removed. Different points in this shell have different distances from the 
x axis, but when |P|, the norm of P, is small, these distances are all 
nearly yf. Taking 2ry# to be the circumference of the shell, we use the 
number 


(4.57) omy? (: — ; vt) 


to approximate the area of the shell. Multiplying this by Ayx, the 
thickness of the shell, gives an approximation to the volume of the shell. 
This leads us to the formulas 


(4.571) y = lim ` 2ryž (: — fot) Ayk 
and 
b 1 
(4.58) y = anh | y (1 ~ zy) dy = 4nb*h. 
0 


For finding volumes of cones, the slab method produces answers much 
more easily and quickly than the cylindrical shell method. Most of the 
problems at the end of this section should be solved by the slab method, 
but the cylindrical shell method sometimes works better than the slab 


method. 


Problems 4.59 


I Find the volume of the spherical solid (or ball) of radius a which has its 
center at the origin. Find out whether it is easier to partition the whole inter- 
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yal —a Sx £ aor to take double the result of partitioning the interval 0 < x < 
a. Remark: Scientists should always remember that the volume is 7a’. 

2 Supposing that 0 Sh < 2r, find the volume V of the segment of the 
spherical ball with center at the origin and radius r which lies between the planes 
having the equations x =r—handx=r. Ans.: 


Van [" (2 — x) de = brhGr — h). 


3 The region in the first quadrant bounded by the graphs of the equations 
y = kx, x = 0, and y = 4 is rotated about the y axis to produce a solid S which 
is a part of a solid paraboloid like the nose of a bullet. Show that |S], the volume 
of S, is exactly half the volume of a solid circular cylinder having the same base 
and altitude. 

4 The region bounded by the graphs of the equations y = kx? and y = 4 
is rotated about the line having the equation y = 4. Find the volume of the 


resulting solid. Ans.: 
1674? A 
15 k` 


5 A region R is bounded by the graphs of the equations xy = 1, y = 0, 
x =a, and x = b for which O<a< b. Find the volume |S| of the solid S 
obtained by rotating R about the x axis. Ans.: r/a — m/b. 

6 The region bounded by the graphs of the equations x = 1, x = 2, y = 0, 
and y = %4 V9 — x? is rotated about the x axis. Find the volume of the result- 
ing solid. Ans.: 800/27. 

7 The region bounded by the line and hyperbola having the equations 
x+y = 5 and xy = 4 is rotated about the y axis. Find the volume V of the 
solid thus generated. Ans.: 9r. 

8 Leta cylindrical shell (which resembles the part of a tomato can remaining 
after the top and bottom have been removed) have length L and have inner and 
outer radii rz, and rẹ. Supposing as usual that Ar, = rk — rx-1, prove that 


the volume of the shell is 
y 
(rri \L Arr, cls 


where rz is the number defined by rp = 4(r,-1 + Th). 
9 Set up two different integrals for the volume 
of the solid torus (or anchor ring) obtained by 
rotating the circular disk of Figure 4.591 about 
the y axis. First make a partition of the interval 
OS ya of the y axis and estimate volumes of 
washers (things normally associated with nuts and Figure 4.591 
bolts). Then make a partition of the interval 
b—-asSx<Zb+ a4 and estimate volumes of cylindrical shells (things which, if 
they had tops and bottoms, would be tin cans). Evaluate one of the integrals. 
Remark: The correct answer agrees with the result of applying a famous old 
theorem which says that the volume of the solid of revolution is the product of 
the area of the set rotated and the distance the centroid (in this case, the center) 
goes. The theorem is the theorem of Pappus. 
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10 Find, in two or three different ways, the volume of the solid obtained by 
replacing the disk of the preceding problem by the square with horizontal and 
vertical sides tangent to the disk. One of the methods is suggested by the remark 
at the end of the preceding problem. 

11 Find the volume of the solid obtained by rotating, about the y axis, the 
region bounded by the graphs of the equations y = 3x? and y = 12. Ans: 
24r. 

12 Find the volume of the solid generated by rotating, about the x axis, 
the region in the first quadrant bounded by the graphs of the equations y = 3 
x =0, andy =8. Ans.: Hr. 

13 Let a>0. Two circular cylinders of radius a have their axes on the 
x and y axes. With axes so oriented that the z axis is vertical, sketch the part 
of the first cylinder which lies in the first octant and between the planes x = 0) 
and x = 5a. Then sketch the part of the second cylinder which lies in the first 
octant and between the planes y = 0 andy = 2g, 
For three values of z, sketch the lines in which a 
horizontal plane through (0,0,z) intersects the parts 
of the cylinders in the figure, and then sketch the 


curve in which the parts of the cylinders intersect. 
7 If the figure is reasonably good, it should be easy 


2 


BEN to find the volume V of the solid bounded by the 

three coordinate planes and parts of the two cylin- 
An ders. Doit. Ans.: Figure 4.592 and F = 243/3. 
Figure 4.592 14 Find a reason why the answer to the preced- 


ing problem must be less than a’. 

15 A cylindrical hole is drilled through the center of a spherical ball. It is 
observed that the length of the hole is L. Show that the volume of the part of the 
ball remaining is the same as the volume of a spherical ball of diameter L. 

16 A section of a tree trunk is a section of a right circular cylinder of radius a. 
A wedge is removed by making two cuts to a diameter (line, not number), one 
cut being in a horizontal plane and the other being in a plane which makes the 
angle @ with the horizontal plane. Find the volume of the wedge. 

Ans.: ĝa? tan 6. 

17 It is of interest to know that our methods are powerful enough to enable 
us to derive the standard formula 


(1) V = $rabc 


for the volume V of the solid in E; bounded by the ellipsoid having the equation 
x? y g 
(2) ztptazl 


in which a, b, ¢ are positive constants. The formula for the volume can be 
remembered with the aid of the fact that if a = b = ¢ = r, then (2) is the (or an) 
equation of a sphere of radius r and (1) gives the volume of the ball which it 
bounds. To start to find the volume of the part of our solid containing points 
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(x,y,z) for which y 2 0, we make a partition of the interval 0 Sy 3b). When 
0 < y < b, we can put (2) in the form 
x? zo | 
(3) aTa = pE- y’) 


and hence in the form 


(4) 


x? y? 
7a. pet tz Zna EL. 
G Ge) 
When y has the constant value yf, (4) has the form 


(5) gtRal, 


where 
6) Amo VB-y, BEVA of 


This shows that, as Figure 4.593 indicates, the plane having the equation y = yz 
intersects our solid in an elliptic disk 
which, according to Problem 19 of 
Section 4.4, has area 74B or 


Tac *2 
) 


(7) — (b? — yk 


The volume of the slab of our solid 
which lies between the planes having Figure 4.593 
the equations y = yz_) and y = 4, is 

then exactly or approximately the result of multiplying (7) by Ay,. Thus 


(8) y =2 lim ` = (b2? — y*) Ay, 


the factor 2 being required because we partitioned only the interval 0 S y 
The limit of Riemann sums being a Riemann integral, we obtain 


IA 
~ 


0) y = ae 


[e-a 


and hence (1). In case two of the three numbers a, b, ¢ are equal, say a = c, 
the graph of (2) is called a spheroid. When finding the volume of the solid 
bounded by a spheroid, it is possible to simplify matters by using circular disks 
instead of elliptic disks. Some scientists consider it to be more fun to work out 
the above formulas than to remember that a spheroid for which a = ¢ < b 
is called a prolate spheroid (like the surface of a cucumber or a watermelon) and 
that a spheroid for which a = c > b is an oblate spheroid (like the surface of a 
pancake or an unscarred earth that bulges at its equator and is flattened at its 
poles because of its rotation). 

18 From time to time, we recognize the fact that some scientific terminologies 
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and notations have their historical origins in primitive ideas that are fuzzy or 
incorrect. The number in the right member of the formula 


(1) lim > G) Ax = f r dx 


is, when it exists, defined in terms of Riemann sums in a way which we must 
now understand. If (1) holds, then to each positive number e there corresponds 
a positive number ô such that 


(2) $ fe) Ax, — J t dx| <€ 


k=l 


whenever P is a partition of the interval a S x < b for which |P| < 6. Fora 
long time before this precise idea of Riemann revolutionized (or counter-revolu- 
tionized) mathematics, it was generally considered to be meaningful to regard 


e e a >) b 
the limit of sums as “the sum of infinitely many infinitesimals.” Thus Í F(x) dx 
a 


was considered to be an “infinite sum” of products of “finite” numbers f(x) and 
“infinitesimal” numbers dx. The “reasoning” involved is quite as flimsy and 
unrewarding as the “reasoning” which reaches the ‘‘conclusion” that “a circle 
is a polygon having infinitely many infinitesimal sides because it is a limit of 
polygons.” In mathematics, as in other sciences, many of our ancestors were 
intrigued by ideas which are now considered to be obsolete. Nowadays we accept 
the idea that the sum of the volumes of many thin slabs can be a good approxima- 
tion to the volume of a spherical ball, but we reject the fuzzy idea that the 
volume of the ball is the sum of the volumes of infinitely many infinitely thin 
slabs. It is not easy for historians to decide which of our great ancestors really 
had quite correct ideas about approximations and limits and, without swallowing 
ideas about sums of infinitesimals, merely used the fuzzy terminology because it 
was the fashion to do so. There can be tenuous connections between ideas and 
words. If Leonhard Euler wrote in a language in which apples were called 
“potatoes that grow in the air,” historians unaware of the fact have an oppor- 
tunity to conclude that this intellectual giant did not know the difference between 
potatoes and apples. Some people believe that the notation for integrals is bad 
because it makes too many people think that the dx is a number. The author 
believes that terminologies and notations involving limits are the real sinners 
because they make too many people think that numbers and partitions and other 


things are mobile. Perhaps replacing “lim” by “approx” in (1) would cure many 
of our ills. 


4.6 Riemann-Cauchy integrals and work This section introduces 
integrals that are, in some cases, not Riemann integrals but are con- 
structed from Riemann integrals by use of ideas that were made precise 
by the French mathematician Cauchy (1789-1857). It may happen that 
the integral in the right member of the formula 


co . h 
(4.61) [T f@) dx = lim [, Fæ) ax 
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exists as a Riemann integral whenever h 2 a and that this integral, as a 
function of h, has a limit as h becomes infinite. In such cases, this limit 
is the Reemann-Cauchy integral of f over the semi-infinite interval x = a 
and is denoted by the symbol in the left member of (4.61). In each other 
case, we say that the integral in the left member of (4.61) does not exist 
as a Riemann-Cauchy integral. For example, when r > 0, 


h _ 
(4.611) in pide = lim [Yatde = tim Z |! = tim 1_1 — 1. 
r h— œ —1 r hoo | 7 h r 


We can bravely start to calculate a Riemann-Cauchy integral by 
tentatively writing 


0 . h 
(4.612) h cos x dx = lim [, cos x dx = lim | sin an = lim sinh 


h— œ h— œo 


with the understanding that we will get an answer if the last limit exists. 
The last limit does not exist, however, so the integral does not exist. 
Riemann-Cauchy integrals of another type are defined by the formula 


(4.62) JEE) de = lim JERE) ax 


when a > 0 and the integrals and limit exist. Consider the example for 
which f(x) = x~% when x > 0, while f(x) is either undefined or is defined 
in some other way when x < 0. Then fis not bounded over the interval 


1 
0 <x $1 and hence J, f(x) dx cannot exist as a Riemann integral. 


However, 


(4.621) h x dx 


I 
z: 

L 
as 
> 


li 
z 
r— 
Zo oT 
x R&R 
Ly 
| 


= lim |2 -2 vi] = 


h=>0+ 


so the first integral exists as a Riemann-Cauchy integral. Riemann- 
Cauchy integrals of still other types are defined by the formulas 


(4.622) f? f@) dx = lim fa fle) dx 
JE FO dx im JER) dx 


when the integrals and limits exist. Finally, the Riemann-Cauchy 
integrals in the left members of the formulas 


(4.623) JESE de = [E EO de + fi Fe) de 
(4.624) f: fle) de = f° f(x) dx + im fx) dx 
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are defined by these formulas whenever the integrals on the right exist as 
Riemann-Cauchy integrals. Perhaps attention should be called to the 
fact that some elementary books reserve the term “definite integral” 
for application to an integral of a particular brand (which is sometimes 
the Riemann brand and is sometimes not care. 
y fully delineated) and apply the term “improper 

integral”? to each integral of another kind. 
It is impossible to have a tranquil scientific 
career without thorough understanding of 

matters relating to 


1 1 
(4.63) | =a aX. 


The graph of the integrand is shown in Figure 
-2 -1 0 1 2 * 4.631. The integral cannot exist as a Riemann 
Figure 4.631 integral because the integrand x~? is undefined 


when x = 0. Even if we set f(0) = 0 and 
f(x) = x? when x =Æ 0, the integral J i f(x) dx will still fail to exist as 


a Riemann integral because f is not bounded over the interval] 
—1 <x <1. According to (4.624), the formula 


lj I, ‘1s 
(4.632) [3 E- nt [Ss x 


will be valid when the integrals are Riemann-Cauchy integrals provided 
the two integrals on the right side exist. The calculation 


(4.633) T l Js= lim [Cae = lim z- 1| = æ 
—1 x? hot J-1 X? r0+ Lh 


shows that the first integral on the right does not exist, and the calculation 


1 
(4.634) [ae = lim ie ~ dx = lim z — 1| = © 
0 h0+ r0+ | A 


shows that the second does not exist. Hence, the integrals in (4.632) do 
not even exist as Riemann-Cauchy integrals. The calculations do, how- 
ever, enable us to convey information by writing 


1 1 0 1 1] 
(4.635) J ar-f het] za dx = o + œo = 0 


Persons do not lead these tranquil scientific lives when they realize that 


(4.636) d-i_i 


ax x x 
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except when x = Q and cheerfully make the calculation 


1 1 1 
(4.637) [se = =f, = —2 (2???) 


which would be correct if (4.636) were valid over the whole interval 
—1 <1. Since the wide world contains many definitions of inte- 
grals in addition to those of Riemann and Riemann-Cauchy, it is some- 
what presumptuous to assert that (4.637) is ridiculous. However, when 
we confine our attention to Riemann and Riemann-Cauchy integrals, 
we can observe that (4.637) is incorrect. 

Integrals of the types in (4.61) and (4.623) are particularly useful. 
For example, the formula 


1 (e) _(z—M)? 
4.64 7a | e 202 dx = 1 0 
( ) / On > x (e > 0) 
is not easily proved, but it lies at the foundation of very much work in 
probability and statistics. Proof of this formula will appear later. 

We conclude this section with a discussion of work in which the 
formula 


(4.65) i x=} (a > 0) 


— o 


plays a fundamental role. To begin, we study the amount of work done 
by a force F which pulls a particle P from the place on an x axis where 
x = a to the place where x = b. The force F may have the direction of 
the x axis but, as in Figure 4.651, this 

is not necessarily so. Let f(x) denote F 

the scalar component of the force F in Oo goo 
the direction of the motion, that is, in 9 a Ax b x% 
the direction of the x axis. In case f(x) Figure 4.651 
is a constant, measured in pounds (or 

dynes), and the distance b — a is measured in feet (or centimeters), 
the work W done by the force is measured in foot-pounds (or dyne- 
centimeters) and is defined by the formula 


(4.652) W = f(x)(6 — a). 


Since work and distance are scalars and force is a vector, it is quite 
incorrect to perpetuate the ancient idea that “work is force times dis- 
tance”; we must use scalar components of forces. In case the scalar 
component f(x) is different for different numbers x, the definition (4.652) 
is inapplicable and we need integration to calculate W. The procedure 
is almost identical with the procedure used to calculate areas and vol- 
umes. We make a partition P of the interval from a to b with a “small” 
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norm |P| and write f(«*) Ax, as an approximation to the amount of work 
done in pulling the particle from the left end to the right end of the kth 
subinterval. The sum 


(4.653) Xf (xe) Axx 


should then be a good approximation to our answer W and hence we 
should have 


(4.66) | W= lim X flx#) Axr = f: fx) de. 

Our statements about (4.653) and (4.66) were necessarily vague and 
optimistic because the quantity W that we are trying to calculate has not 
yet been defined. We must recognize the fact that we cannot prove 
correctness of a formula for W when we have no definition or other 
information that tells us what W is. In the absence of another definition 
or other information, we must adopt the principle that our work with 
partitions and Riemann sums provides the motivation for the definition 
whereby W is defined by the formula 


(4.661) W = f; fla) dx 


whenever f is a function for which the integral exists as a Riemann integral. 
The above ideas will now be applied to basic problems. The Newton 
(1642-1727) law of universal gravitation says that if two particles of mass 
mı and me are concentrated at distinct points P, and Pe, then these 
particles attract each other with a force whose magnitude is proportional 
to the product of the masses and inversely proportional to the square of 
the distance between them. Suppose we have a particle of mass m, 
concentrated permanently at the 
mo o mrl __œ—— origin, Figure 4.67, amd that we 
o a x b «æ have a “test particle” of unit mass 
Figure 4.67 that we wish to move along the 
positive x axis. There is then a 
constant k, which depends only upon the units used to measure mass, 
force, and distance, such that the force on the test particle has magnitude 
kmı/x? when the particle is at distance x from the origin. The work Wa» 
required to move the particle from the point a (that is, the point with 
coordinate a) to the point b is then to be calculated from the formula 


b 
(4.671) Was = Í Era dx. 
From this we find the remarkably simple formula 


(4.672) W, = Pm km, 
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It follows from these formulas that 


(4.673) lim Was, = i em dx km, 
b— œ a 

This formula is responsible for some terminology that scientists often use. 
The limit in (4.673) is called “the amount of work required to take the 
test particle from a to infinity”? and this amount of work is called the 
potential (or gravitational potential) at the point a due to the particle of 
mass m, at the origin. [Itis an easy consequence of these definitions and 
formulas that the potential, say u, at the point P (x,y,z) due to a particle 
of mass m, concentrated at the point Polxo,Yo,Z0) is 


km, , 
V (x — xo)? + (y — yo)? + (z — 20)? 


The basic importance of the concept of potential u lies in the fact that if a 
particle of mass m is moved from a point P, to a point Pz with no forces 
upon it except gravitational forces and a force F, and if the speeds at 
P, and Pz are equal, then the work done by the force F is equal to the 
product of m and the potential diference, that is, the potential at the 
starting point P; minus the potential at the destination Pp». 

All of the above ideas and formulas apply to electrostatic potentials as 
well as to gravitational potentials. In the electrical case, we start with 
two charges qı and ge and apply the Coulomb (1736-1806) law 
IF| = 2g19¢2/x?, which is the electrical analogue of the Newton law of 
gravitation. 


(4.68) u = 


Problems 4.69 


1 Suppose somebody writes 


1] | 
J -dx = ©, J =- dx = 0 
0 x 1 x 
with the hope that he is conveying information to you. What does he mean? 
Ans.: 


e 11 e hj 
lim -dx = œ, lim dx = ©, 
hRo-o+ Jh X joao J] X 
2 Prove that 
o% I 1 11 
(2) [Se- [se -1 ($ <1) 
1 gp OX = Ms o xT Ih 
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4 Remembering that e = 2.71828, ang 

emembering or learning that e? is about 20, ¿$ 

Py | rae | about 400, and ° is about 8000, make s 

Aoma calculations to indicate that Figure 4691 
1 2 3 ‘ shows the nature of the graph of y = e, 

Figure 4.691 Observe that the area of the shaded region 

seems to be about the same as the area of the unit square. What are the facts? 

5 The region bounded by the cissoid having the equation 


2 x 
Y T Za—x 
and its asymptote is rotated about the asymptote. Using the cylindrical shell 
method, set up an integral for the volume V of the solid thus generated. Clue 
and ans.: 


V = 2 lim ) 2ra — x)y Ax 
y = 4r h 72 (2a — x)¥ dx. 


Remark: With the aid of information about beta integrals, it can be shown very 
quickly that V = 27a’. 
6 Show that putting M =0 and o = 1/+/2 in (4.64) gives the formula 


J Da e? dy = \/T. 


Sketch a graph of y = e77? which is good enough to show that this result seems 
to be correct. 

7 Prove that if f(x) = 4 when 0 <x < 1 and f(x) = 5 when 1 <x <2, 
then 


IEO dx = 9. 


Note that f(x) is undefined when x = 0, when x = 1, and when x = 2. 
8 Prove that 


. h . h 
lim x dx = 0, lim x dx = w, 
haw J h= 0 


9 Even persons having little contact with the external physical world know 
that rods and wires and springs stretch when they are pulled and that the amount 
of stretching depends in some way upon the amount of pulling. Engineers have 
understanding of elastic limits and of circumstances under which useful results 
are obtained by applying the law of Robert Hooke. The Hooke law says that 
the magnitude of the force required to stretch a rod of natural length L to length 
L + x is 


ao) 


m~i aw 


where & is a constant that depends upon the rod. The number x is the elongation 
of the rod, and the magnitude of the force is proportional to the elongation. 
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Figure 4.692, which shows the rod before and after stretching, may be helpful. 
Supposing that 0 < a < b, find the work done in stretching the rod from length 
L+ atolength L +b. Ans.: 


L3 (b? — a?). 


10 AÀ conical container (see Figure 4.693) has height 
a feet and base radius R feet. It is filled with sub- 
stance (water or wheat, for example) which weighs w 


a z 
moree 


Figure 4.692 Figure 4.693 


pounds per cubic foot and which must be elevated (by a pump or shovel or other 
elevator) to a level H feet above the vertex. Suppose that H 2a. Find the 
work W required to accomplish the task. Hint: Start by making a partition of 
the interval 0 S y S a and calculating an approximation to the work required 
to lift the material which constitutes a horizontal sheet or slab. All calculations 
are based upon the fundamental idea that gravity pulls things downward, and that 
the magnitude of the force on a thing is its weight. Ans.: 


Note that if V is the volume of the conical solid, then the answer can be put in 
the form X = wV(H — ła). 

11 Modify Problem 10 by replacing the conical container by a container 
such that, for each y* for which 0 < y* < a, the plane having the equation y = y* 
intersects the contents of the container in a set having area A(y*). Then set 
up an integral for the work W. Ans.: 


W = w | (H - y)AQ) dy. 


12 In many problems involving motion of particles, we need the concept of 
kinetic energy, or energy due to motion. This problem requires us to study and 
learn a method by which we can use calculus to derive an important formula. 
We suppose that, at time t = 0, a particle of mass m starts from rest, with kinetic 
energy zero, at the origin of an x axis and is pulled in the direction of the positive 
x axis by a force F of constant magnitude for which F = Ci at all times. We 
suppose that no force other than F operates on the particle. Letting x denote 
the coordinate of the particle at time t, we use the Newton law F = ma to obtain 
the vector equation 


2 
(1) m Ai = ma = F = Ch 
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From this we conclude that there must be a constant vector Cı such that 


dx 
(2) m Fi = Ci +c 


But (dx/dt)i is the velocity v at time ż, and putting t = 0 in (2) shows that 
cı = 0. Therefore, 


(3) my =m% i= Chi 

From this we conclude that there is a constant vector Cz such that 
(4) mxi = Cri + Co. 

But x = 0 whent = 0, soc, = 0. Therefore, 

(5) mxi = Cri. 


The kinetic energy KE of our particle at time ż is defined to be the amount of 
work done by the force F in bringing the particle from its state of rest at time 
t = Q to its state of motion at time ż£. Since |F| has the constant magnitude C 
and has the direction of motion of the particle as the particle moves the distance 


x, the amount of work done is Cx. Thus KE = Cx and, with the aid of (5) and 
(3), we find that 


(6) KE = Cx = -L (C)? = $mlv]2. 
2m 

Therefore, 

(7) KE = ġm]vl?. 


The next problem requires that the same result be worked out by a different 
method without the assumption that F is a constant. 

13 A particle P of mass m is moved around in E; by a continuous net force F 
which operates over a time interval a $1 < b. The Newton law F = ma then 
shows that the displacement vector r (which is OP), 
the velocity vector v, and the acceleration vector a 
are continuous functions of t. Make a partition of 
the interval a S ż < b and look at Figure 4.694 which 
shows, among other things, the positions of P at times 
ty, and t. Tell why the scalars 


(1) Felel) —r(te1)] and  E(zr) v(t) Atr 


Figure 4.694 

should, when |P| is small, both be good approximations 
to the work done by F in forcing P from P,_; to P} Tell why it should be 
reasonable to adopt either one of the formulas 


n 


(2) = jim x F()-[r(i) — (4-1) 


(3) W = lim 5 F (tr) V(t) Aty 
IP|>0 kal 
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as the definition of the work done by F over the time interval a < ż < b. Show 
that (3) is equivalent to the definition 


(4) W = f F(2)-v(2) dt. 


It remains for us to learn a little trick by means of which information can be 
gleaned from this formula. Using the Newton formula F = ma gives 


(5) F()-v(t) = ma(t)-v(t) = mv’(t)-v(2) 
l d 
= zrg OvO = 5 mF rO 


Hence 

1 d b 
(6) W = 7” f: a [ou dt = > m [mor] 
and 
(7) = ; m|v(b)|? — 5 mlv(a)|2 


In case v(a) = 0, our work gives another derivation of the formula for the kinetic 
energy of a particle of mass m having speed |v(b)|. 
14 The graph of the equation 
ax 
a + 5?’ 


which usually appears in the disguised form x*y + 5% — a% = 0, is called a 
serpentine. Find the area (finite or infinite) of the region in the first quadrant 
between the serpentine and its asymptote. 
15 Accumulation of familiarity with Riemann sums may bring a desire to 
b 
know why Í. f(x) dx cannot exist as a Riemann integral when f is defined but 


unbounded over the intervala S x < b. Ifthe integral exists and has the value 
I, then there must be a partition P of the interval a S x < b such that 


(1) | fet) An, - I) <1 
k=] 
whenever x,_1 £ xp <x, for each k. Show that if (1) holds, then 
2) EDI < (An) +14 Ý lye An | 
k=2 


when xo £ xf < xı. This shows that f must be bounded over the first subinterval 
of the partition P. Similar arguments show that f must be bounded over the 
other subintervals and hence also over the whole interval a Sx S b. 


4.7 Mass, linear density, and moments This section involves 
some ideas that turn out to be important in many ways. Let F be a 
function which is defined over some finite interval a S x S b and is 
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monotone increasing over the interval. This means that F(x:) < F(x.) 
whenever a < xı S x. S b. Such functions F arise in many ways. We 
can, for example, let P (a number) denote the population of an island, 
state, or country, let P(x) denote the number of persons having age less 


than or equal to x, and let 


(4.71) F(x) = Pe). 


We can also suppose that the interval a S x < b represents a line segment 
or a slim beam, as in Figure 4.72, 

Send upon which sand and perhaps other 
— © things are piled and from which hams 
e 4 t z= ł è and other things are hung, and let 

? 

Figure 4.72 F(x) be the total mass which rests 
upon or hangs from the part of the 

interval from a tox. Because of the vividness of the latter interpretation, 
F is sometimes called a mass function even when F(x) is a number which 
is important in social sciences and which has nothing whatever to do with 


such things as pounds and tons and grams and slugs. When x and 
x + Ax both lie in the interval from a to b, the difference quotient 


F(x + Ax) — F(x) 


(4.73) Ax 


represents the average mass per unit length or the average linear density 
over the interval with end points atx and x + Ax. In case this quotient 
has, for a given x, a limit as Ax approaches zero, this limit is called the 
density at x. When this density exists, we call it f(x) so that, by our 
definition of derivatives, f(x) = F’(x). This idea of density has its 
simplest applications in cases where F(x) has a continuous derivative. 
In these cases the function f having values f(x) is called the density 
function of the mass function F, and f(x) = F’(x) for each x. 

We are now ready to start introducing moments. Without assuming 
that F is differentiable or even continuous, let £(xi) be a number (or point) 
not necessarily in the interval from a to b, and let p be an integer which 
is either O or positive. Let P be a partition of the interval a S x S bas 
shown in Figure 4.74. The number F(x) ~ F(xx_1) is the number 

. obtained by starting with the total mass in the interval a S x < x, and 


—_ 
= 


Figure 4.74 


Xp =b 
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subtracting the total mass in the interval a S x S xı. Thus it is the 
total mass in the interval x1 < x S x,. The number 


(4.75) (xf — EPEa) — F(xe-1)] 


represents the pth moment about the point ¢ of a single particle of mass 
F(x) — F(xx-1) concentrated at the point «* and, when the norm of P 
is small, this should be a good approximation to the pth moment about £ 
of the total mass in the interval xı < x S x,. Moreover, the sum 


(4.76) Š (xf — &)?[F (xn) — F(xn-1)] 


should be a good approximation to the pth moment about ¿£ of the total 
mass in the interval a S x Sb. Our statement about (4.76) was neces- 
sarily vague and optimistic because the quantity we are trying to calculate 
has not yet been defined. It is a fundamental fact, which is proved in 
the theory of Riemann-Stieltjes integrals, that there is a number M®?, 
such that the sum in (4.76) is near it whenever |P| is small, that is, 


(4.77) M2, = dim X) (xf — EPEC) — Flr). 

This number M{ is called the pth moment about the point ¢ of the mass 
in the interval a S x S b. Incase p = 0, the pth moment is the total 
mass in the interval aS x < b. In mechanics, the second moment is 
called moment of inertia. In statistics and elsewhere, the particular 
number ž for which MS. = 0 is called the mean (or mean value) of F over 
the interval a S x S b. In mechanics and elsewhere, the point having 
coordinate ¥ is called the centroid of the mass. The number MË- the 
second moment about the centroid or mean, is particularly important in 
mechanics and statistics. 

The above discussion applies equally well to mass functions F that 
possess continuous density functions f and to those that do not. When 
F does possess a continuous density function f, we can solve problems with 
the aid of only Riemann integrals. In the latter case the number 


(4.78) f(x) Ax 


is taken to be an approximation to the total mass in the interval 
xe-1 <x SS x,y, and instead of (4.76) we use the Riemann sum 


(4.781) y (xE — EPS (xk) Ame 
k=l 


as an approximation to M®,. Taking limits as the norm of the partition 
P approaches 0 then gives the formula 


(4.782) M®, = J; (we — £)°f(x) dx. 
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Problems 4.79 


1 As suggested by Figure 4.791, let a rod having constant linear density 
(mass per unit length) ô be supposed to be concentrated on the intervala Sx < b 
of the x axis. Starting by making a partition of the interval a $ x S b, 

calculate MP, the pth moment about E of 


re ——— the rod. Ans.: 


O g a b 7 
ô 
Figure 4.791 MP; = oH [(b — 7+ — (a — £)rty, 


2 Using the result of the preceding problem, prove that M De = Q0 if and 


only if £ = $(a + b). 
3 Supposing that 


J| iàd = M>0, 


show that the constant * satisfies the equation 


h (£ — #f(x) dx = 0 
if and only if 


Mx = A xf(x) dx. 


Remark: Always remember that, in statistics and elsewhere, # is called the mean 
(or mean value) of f over the interval a S x S b and that, in mechanics and 
elsewhere, ¥ is the x coordinate of a centroid. Remember (or learn) that a 
centroid is, as it should be, a point “like a center.” 

4 Supposing that 


b 
Í, flx) dx =M>0 
and that the mean (or x coordinate of the centroid) is %, prove that 
M2, = MË; + (#— Ẹ)*M. 


State the meaning of this formula in words, and use the formula to determine the 
value of £ for which MÊ; has the least possible value. Hint: Start by writing 


M2. = f, E- DYE dx = fe — + E- DIE) ae. 


5 Let f be the function for which f(x) = 0 when x < 0 and f(x) =e? 


when x > 0. Determine and graph the mass function F of which f is the density 
function. 


6 The density function f defined by the first of the formulas 


fe = Se E, Fo ee Ef” St 
cs) = 203? , = 262 
Vimo * vee G 


has the mass function (or cumulative function) F defined by the second formula. 
With the aid of the formula (4.64) make a preliminary attempt to learn the 
natures of the graphs of y = f(x) and y = F(x) when M = 0 ando = 0.01. 
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7 When M = 7, the graphs of the functions in the preceding problem are 
different from the graphs obtained when M =0. What is the difference? 

8 When a particle having mass m rotates in a circular path with angular 
speed w radians per second at a constant distance r from an axis of rotation, its 
speed is 7% and its kinetic energy is $mr’w?. With the aid of this information, 
calculate the kinetic energy of a circular disk of radius a which has mass ô per 
unit area and which is rotating with speed w radians per second about an axis 
through its center perpendicular to its plane. Hint: Base the solution on esti- 
mates of the area of a ring, the mass of the ring, and then the kinetic energy of 
thering. Ans.: KE = frôatw?. The answer has the form KE = 4Jw?, where J, 
the polar moment of inertia of the disk about the axis used, is 4rôat. 

9 The cone of Figure 4.51 has mass 6 per unit volume and is rotating w 
radians per second about its axis. Find its kinetic energy. Hint: Use the answer 
of Problem 8. 

10 Figure 4.792 can make us wonder whether we are becoming wise enough 
to determine the attractive force F upon a particle of mass m at P(x,y,z) that is 
produced by a bar or rod concentrated upon an 
interval a S x < b of the x axis of an x, y, z 7 P(£,y,2) 
coordinate system. We suppose that the bar o? 
has linear density 6(x) at the point (x,0,0) and 
that ô(x) is integrable but not necessarily 7 
constant over the interval asxxzsb. The Figure 4.792 
first task is to set up an integral for F. The 
following solution of this problem should be read even by those who can solve 
the problem without aid and assistance, because it fortifies our understanding 
of the process by which integrals are set up. We make a partition ĝ of the inter- 
vala Sx < b, but we call the partition points to, t1, © © © , tp because the num- 
ber x is the x coordinate of P. If the trick helps us, we can consider the x axis 
to be simultaneously an x axis and a ż axis. For eachk=1,2,-°+,nletép 
be chosen such that %_1 S tt < t We then use the number 


(1) ô (tk) At 


as an approximation to the mass of the part of the rod in the intervaly1<t3S 
i, Supposing that this mass is concentrated at the point P;(z,,0,0), we use the 
number 
Slt) At 
(2) m ee) An 

|PP,? 


as an approximation to the magnitude of the force AF, on the particle at P pro- 
duced by the part of the rod in the interval %1< 14 £ % Problem 19 of Prob- 
lems 2.39 discusses this matter and shows how we derive the formula 


5(t*) Ats PP, 


(3) AF; = Gm a 
|PP;I? 


by use of the fact that a nonzero vector is the product of its magnitude and a 
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unit vector in its direction. Our next step is to write PP, in terms of the coord}. 
nates of P and P, and to write 


(te — x)i — yj — zk 


(4) X AF, =GmÈ 5 (tz) (te _ x)? + y2 + z?]% 


Atg. 


Everything is now prepared for the crucial steps. When the norm |Ọ| of the 
partition Q is small, the sums in (4) should be good approximations for the force 
F that we are trying to define. In other words, F should be the limit of these 
sums. But these sums are Riemann sums and, provided P(x,y,z) is not a point 
on the interval a S x <S b of the x axis, they have a limit which is the Riemann 
integral in the formula 


b (¢ —x)i — yj — zk 
6) P= 6m f, O a A 
Our work motivates the definition whereby F is defined by (5). While (5) serves 
as a source of information about F in other cases, we confine our attention here 
to the case in which the density is a constant, say 6(¢) = ôo for each #, and, more- 
over, y =z=OQOandx<a<b. In this case, F has the direction of i, and if 
we denote its magnitude by F;(a,),x), then 


b 
(6) Fy(a,b,x) = Gmdq Í, (t — x)? di 
1 
= Gmò (72 ~ 55) 
It is easy to see that 
(7) lim Fy(a,b,x) = C”, im Fy(ajb,x) = œ. 
b> a a— x% t—a— 


If these formulas agree with our intuitive notions, then at least some of our 
intuitive notions are good. The second result in (7) gives us a lesson in approxi- 
mation. Since particles near ends of actual steel rods are not subject to huge 
attractive forces, we must conclude that very bad approximations to forces on 
these particles are obtained from calculations based on assumptions that the 
rods are concentrated on their axes. 

11 Modify Figure 4.792 to fit the case in which h>0, a = —h, b= h, 


d(x) = do, x = 0, and z = 0. Show that, in this case, formula (5) of the pre- 
ceding problem becomes 


A ti — yj 
F = Gms -a Œ + y9% di. 


After having a good look at the coefficients of i and j, show that 
F 2Gmô tol 
= — Lom oyj IA (2? + y?) 4 dt. 


12 A thin cylindrical shell S of radius R has its axis on the x axis of an x, 4, 
z coordinate system and has its ends in the planes having the equations x = a 
and x = b. This shell has constant areal density (mass per unit area) 6. Find 
the gravitational force F which it exerts upon a particle m* of mass m which is 
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concentrated at the point (c,0,0). Hint: 
As suggested by Figure 4.793, make a 
partition P of the interval a Sx Sb. 
Consider the part of the shell between 
the planes having the equations x = x,_} 
and x = x, to be a circular ring having 
its mass M, concentrated i in the plane Figure 4.793 
having the equation x = x. Let AF, 

be the force exerted upon m* by this ring. Because of symmetry, the components 
of AF; orthogonal to i are zero. Moreover, the i component of AF, (which is 
AF,) is the same as the i component of the force on m* produced by a single 
particle of mass M;, concentrated at the point (xz,R, 0) in £3. Therefore, 


[(x* — c)? + a 


and we are ready to calculate Mp and get on with the calculus. Ans.: 


1 1 , 
(2) F = 2rôGmR ( Se STE =) i. 

13 We can claim that if the density ô and the radius R of the cylindrical shell 
of Problem 12 are so related that the total mass is M, then the answer to Problem 
12 should be nearly the same as one of the answers to Problem 10 when R is 
near zero. Isitso? Ans.: Yes, unless misprints disrupt the harmony. 

14 A circular disk of radius H has its center on the x axis of an x, y, z coordi- 
nate system and lies in the plane having the equation x = xo. This disk has 
constant areal density (mass per unit area) 6. Set up an integral for the gravita- 
tional force F which the disk exerts upon a particle m* of mass m which is con- 
centrated at the point (c,0,0) when c # xo Hint: Make a partition with the 
aid of which the disk is split into a collection of concentric rings so that a repre- 
sentative ring has radius ry. The hint of Problem 12 provides a formula that 
can be adapted to give the force which the representative ring exerts upon m*. 


Ans.: 


(1) AF, = Gm 


(1) F = 21 8Gm(x — c)i f° CERE tae 
1 
(2) F = 296Gm(xo — c) lea — TES i. 


When M is the total mass of the disk so that M = mH?2ô, the answer can be put 
in the form 


GmMTx—¢ xo — c . 
3 F =2——/[ 2 L aÃ 
(3) 2 H? Fer H? + = |} 


Remark: We really should look at these formulas. For example, (2) gives very 
interesting information when H is large and our disk is a huge part of a homo- 
geneous plane. One who wishes additional mental elevation should undertake 
to realize that we can replace gravitational laws and constants by electrostatic 
ones and obtain information about forces on electrons produced by charges on 
plates of capacitors. 


266 Integrals 


15 A cylindrical solid of radius R has its axis on the x axis of an x, y, z coordi- 
nate system and has its ends in the planes having the equations x = a and x = b, 
This solid has uniform density (mass per unit volume) 6. Find the gravitational 
force F which this solid exerts upon a particle m* of mass m located at the point 
(c,0,0) of £3, it being assumed that c < a. Hint: Make a partition of the interya] 
a<x2<b. Consider the part of the cylinder between the planes having equa- 
tions x = x,_-1 and x = x; to be a circular disk in the plane having the equation 
x = xf. Let AF, be the force exerted upon the particle m* by this disk. A 
formula of Problem 14 can then be applied. Ans.: 


F = nGmsi f È _ STE! ax 
F = 2rGmôļlb — a — (V(b — oc)? + R? — Vla — 0)? + RIL 


This can be put in the form 


nM, _ VO=EB-VE=FER), 
R? g b-—a > 


F=2 


where M = rR?(b — a)ô, the total mass of the cylindrical solid. 

16 Let S be a thin spherical shell which is assumed to be concentrated on a 
sphere (surface, not ball) of radius a having its center at the origin. The shell 
has constant areal density (mass per unit area) 6. Let m* be a particle of mass m 
which is concentrated at a point (—b,0,0) which lies at the origin or at distance 
b from the origin on the negative x axis. Thus b 2 0, and we suppose that 
b = a so m* does not lie on the sphere. The gravitational force F exerted upon 
m* by the shell depends upon the location of m*. If0 <b < aso that m* is 
inside the sphere, then F = 0. If b > a so that m* is outside the sphere, then 


(1) F=6™¥i, 


where M is the total mass of the shell. Thus when m* lies outside the shell, 
the force on it exerted by the shell is the same as the force exerted on it by a 
particle at the center of the shell whose mass is the total mass of the shell. From 
our present point of view, proofs of these famous and important results (which are 
discussed in more general terms in Section 13.8) can be comprehended more 
easily than they can be originated. To start our proof, we slice the spherical 
shell into ribbons to which we can apply a basic result given in Problem 12. 

The spherical shell is obtained by 


aA, rotating the semicircle of Figure 4.794 
about the x axis. We make a partition 
m* P of the interval0 < 0 <r. With the 
(-b0,0 (-a,0,0) O (aon) Zz aid of the basic formula 
Figure 4.794 (2) Angle = length of arc 


radius 


we see that the lines making angle 6,.; and 6, with the positive x axis have 
between them an arc of the circle of length a(6; — 6,-1), or a Ay. When this 
arc is rotated about the x axis, it produces a part of the spherical shell which can 
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þe described roughly as a circular ribbon having radius a sin 6;, width a Abr, 
length 2r a sin 6, area 2r a? sin Or Afk, and mass Mp, where 


(3) M, = 2rôa? sin 07 Abr. 


Considering this ribbon to be a circular ring of mass M, and radius a sin 0* which 
has its center on the x axis and which lies in the plane having the equation x = 
acos 07, we use formula (1) of Problem 12 with ¢ = —b to obtain the formula 


(b + a cos 07) sin OF 


AF, = 2rGmôĝa’i ——————— >} 
(4) " [(b + a cos 0r)? + (a sin 6,)7]%% 


Ab; 
for an approximation to the force upon m* produced by one element of the 
spherical shell. The limit of the sum of these things should be the force F that 


we are seeking. But the sum is a Riemann sum and its limit is a Riemann 
integral. This leads us to the formula 


(5) F = 2rGmôaiU, 
where U is the unruly integral defined by 


x (b+ acos ĝ)a sin 0 
o [b? + 2ab cos 6 + a*]% 


The hypothesis that b 2 0 and b Æ a implies that the denominators in (4) and 
(6) are never zero and hence that the integrand in (6) is continuous. Before 
making a serious attack on the integral, we can observe that it is certainly positive 
when b > a and that it is 0 when b = 0. To simplify the integral, we make the 
substitution (or change of variable) 


(6) U = dð. 


(5) a cos ĝ = x, 
so that —a sin 0 d0 = dx. Since x = a when 0 = 0 and x = —a when 6 = 7, 
rules which have not yet been adequately treated imply that 

o= ā bł+x 
(6) U = f: _a DEF 2be + ays A 
To simplify the integral some more when b =Æ 0, we make the substitution 

> t— b — a 
b? + 2bx + a = 1, x= 75 


Since dx = (1/2b) dt, t = (b — a)? when x = —a, andi = (b + a)? when x = a. 
substitution in (6) gives 
t— b? — a? 


1 (b-+a)? b + 2b 


(7) U = 7% (ba)? —~ gy te 
Thus 
1 (b+a)? 

= —b6 — 72\¢7% 
(8) U P Jao: [3 + (b? — a)i] di, 
9 1 , (b-+a)? 

= —— 16 — 2 — g2 —} | 
(9) 7 E (b? — at bw! 


a b-a 


(10) U = 55 | lb + al - b-a- a - Poa | 
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In case 0 < b < a, this gives 


1 2 b2? — a? 


and (5) shows that F = 0 as we wish to prove. In case 0 < a < b, (10) gives 


1 b2?— a? | b? — a? 2a 
(12) U=m[6+9-6-9-Fg54+ 52a] 


Putting this in (5) gives 


Gm(4na?6)i_ GmM 
B= = a i 


where M = 47ra?6, the total mass of the spherical shell. This is the desired result 
(1) and the fundamental facts about attractions of spherical shells are now 
established. 

17 Use the method of Problem 16 but modify the details in appropriate 
places to obtain the force Fg exerted upon m* by the hemispherical shell that 
remains after removal of the part of the spherical shell whose points have nega- 
tive x coordinates. 

18 A spherical ball (or solid sphere) is said to be radially homogeneous if 
there is a function 5 such that the ball has density (mass per unit volume) 6(r) 
at each point having distance r from the center of the sphere. Supposing that 
O < a <b, find the gravitational force exerted upon a particle m* of mass m 
located at the point (—8,0,0) in Æ by a radially homogeneous spherical ball 
(like an idealized earth or golf ball) B which has radius a, which has its center at 

the origin, and which has a density function 6 
which is not necessarily constant but is inte- 
grable. Solution: As suggested by Figure 


G— 4.795, we make a partition P of the interval 
-b -a O xf a x QOsSxa2a. When x18 XE S x, as usual, 
Figure 4.795 the points of B having distance x% from 0 such 


that xp) < XE <x, form a spherical shell 
whose volume is approximately the product of the area 4axz” and the thickness 
Arg. The mass M, of this shell is therefore approximately 


(1) My = Arx 76 (xp) Axe. 


Considering the shell to be concentrated upon the sphere having center at 0 
and having radius xj enables us to use a result of Problem 16 to show that the 
force AF; which the shell exerts upon m* is approximately 


_ Gml4mxy (xn) Axi] ; 


(2) AF, 52 


The limit of the sum of these things should be F. But the sum is a Riemann 
sum and its limit is the integral in the formula 


(3) = ri 
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This answer can be greatly improved if we notice that methods very similar to 
those which we have used enable us to show that the integral in (3) is the total 
mass M of the ball B. Thus we can put (3) in the form 


GmM . 
(4) F = -pr i 


This proves that the force exerted upon m* by a radially homogeneous ball is 
the same as the force exerted upon m* by a single particle, at the center of the 
ball, whose mass is the total mass of the ball. Thus, when computing forces 
upon particles outside the ball, we may “consider the mass of the ball to be con- 
centrated at its center,”’ the assertion in quotation marks being rather weird 
because mass is a number and we do not ordinarily squeeze numbers. 

19 Use the method of Problem 18 to show that if S is a radially homogeneous 
spherical shell having inner and outer radii zı and rz for which rı < r2 then 
F = 0 when F is the gravitational force which the shell exerts upon a particle 
inside the shell. 

20 With the aid of arguments involving continuity, the final formulas of 
preceding problems for gravitational forces upon particles exerted by solid 
spherical balls and solid unconcentrated spherical shells can be proved to be 
correct when the particles lie on boundaries of the balls and shells. Using this 
fact, show how it is possible to split a given radially homogeneous solid ball into 
an inner solid ball and an outer spherical shell to calculate the force which the 
given ball exerts upon a particle m* of mass m concentrated at an inner point 
of the given ball. 


4.8 Moments and centroids in E,and E; Section 4.7 introduced us 
to moments, about a point on a line, of material concentrated upon the 
line. This section introduces us to two similar ideas. In the first place, 
we consider moments, about a line, of material concentrated in a plane 
containing that line. In the second place we consider moments, about a 
plane, of material in £}. 

To begin, let R be a bounded region in the xy plane which lies between 
the lines having the equation x = aand x = b. Itis supposed that when 
a<x* <b, the line having the equation x = x* intersects R in an 
interval (or collection of intervals) having length (or total length) f(x*). 
It is not necessary that f be continuous, but we do assume that K has 


area |R| and that |R| = f f(x) dx. If the region R is, as in Figure 4.81, 


Figure 4.81 
y=) 
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the set of points (x,y) for which a S x <S b and fi(x) S y S f(x), then 
everything is quite simple and f(x) = fo(x) — fi(x). Our next step is to 
suppose that the points (x,y) of R are points of a material body (much like 
a sheet of paper or a sheet of copper) that has been compressed into a 
plane in such a way that, for some positive constant ô, a part of the com- 
pressed material body has mass 6 AR if that part occupies a part of the 
region R having area AR. The compressed body is called a lamina, and 
it is a homogeneous lamina because the areal density (mass per unit area) 
has the same constant value 6 at all places in the lamina. 

Letting p be an integer which is either 0 or positive, we proceed to define 
the number MP, the pth moment of the lamina about the line having 
the equation x = ¢, by a formula from which it can be calculated. Fol- 
lowing the method of Section 4.7, we make a partition P of the interval 
a <x <S binto subintervals. For each è, the lines having the equations 
x = xk- and x = x, have between them a part of the lamina that can be 
called a strip parallel to the line having the equation x = & Supposing 
as usual that x.) S xf S xz, we use the number 


(4.82) f (xi) Axy 


as an approximation to the area of the strip and accordingly use the 
number 


(4.821) Sf(at) Axr 


as an approximation to the mass of the strip. When the norm of P is 
small, all points of the strip lie at about the same distance |x* — é from 
the line having the equation x = £, and multiplying the above mass by 
(xf — £)? should therefore give a good approximation to the pth moment 
of the strip about the line having the equation x = & The Riemann sum 


(4.822) SE (at — £)?f(x#) Axe 


should then be a good approximation to the moment of the whole lamina. 
Since the Riemann sums have a limit which is the Riemann integral in 
the right member of the formula 


(4.83) M2, = 8 [? (æ — 8)°f(x) de, 


our work motivates the definition by which the required moment is 
defined by this formula. 


The number M? the pth moment of the lamina about the line 
having the equation y = y, is defined by the analogous formula 


(4.831) Me, = è ["(y — n?) ay, 


where c and d are numbers such that the lamina lies between the lines 
having the equation y = c and y = d and g(y*) is the length of the inter- 
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yal (or the sum of the lengths of the intervals) in which the line having the 
equation y = y* intersects the lamina. In case p = 0, the pth moment 
is the mass of the lamina. In mechanics and some other places, the sec- 
ond moment is called the moment of inertia. 

While the facts can be established only by considering the different 
rectangular coordinate systems in the plane of the lamina, the lamina 
itself determines a point in the plane of the lamina that is called the 
centroid of the lamina. With reference to the particular coordinate 
system which we have chosen, the x coordinate of this centroid is the 


number * for which M; = 0 when ¢ = & Thus 


(4.84) ô f! (x — fle) de = 0 

and it follows that 

3 f: xf(x) dx 
a ME dx 


where M, the denominator in the second formula, is the mass of the 
lamina. Similar formulas suffice to determine the y coordinate 9 of the 
centroid. For example, 


(4.842) My = 5 | ye(y) dy. 


(4.841) Mz = č Í f(a) de, z 


The centroid of a lamina has an important physical property. If the 
lamina is in a plane perpendicular to the direction of the forces in a 
uniform parallel force field, then the lamina will balance upon each line 
(or knife-edge) which passes through the centroid and will balance upon a 
pin placed at the centroid. It follows that if L is a line of symmetry of a 
lamina, then the centroid lies on L. Moreover, if a point P is a center of 
symmetry of a lamina, then the centroid is P. 

We now turn our attention to the three-dimensional world which con- 
tains, in addition to cubes and spherical balls, so many distractions that 
relatively few of its inhabitants assimilate substantial information about 
nonmeasurable sets in Æ;. To keep these complicated and paradoxical 
sets out of our gardens, we shouldt (and therefore do) start with a set 
S in Æ; which is assumed to possess positive volume VY. In order to be 
able to use Riemann integrals, we assume that wherever we introduce an 
x, y, z coordinate system in Es, there will be numbers a and b for which our 
set S lies between the planes having the equations x = a and x = b. 
We assume that, for each ż for which a <£ t <£ b, the plane having the 
equation x = t intersects S in a section having area which we denote by 
A(t). In many applications this area function is continuous. To be 


} This is another situation in which we can be kept on the path of rectitude by knowledge 
of the contents of Appendix 2 at the end of this book. 
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fully rigorous about the matter, we assume that the Riemann integral in 
(4.85) V = f: A(x) dx 


exists and is the volume V of the set S. Our next step is to suppose that 
the points (x,y,z) of the set S are points of a material body B such that, 
for some positive constant 6, a part of the body has mass 6 AV if that part 
occupies a part of the set S having volume AY. Our body B (which 
really is somewhat different from the conglomeration of atomic particles 
that constitute a potato) is said to be homogeneous because its density 
(mass per unit volume) has the same constant value 6 at all places in the 
body. At last we have a body B which might, for example, be what a 
child thinks a potato is. 

4 Supposing that ¢ is a number and 
that p is O or a positive integer, we 
should now find it easy to construct 
formulas for calculation of the num- 
ber Mi, the pth moment of the body 
B about the plane having the equation 
Figure 4.86 l x = & Realizing that schematic 

figures can be helpful even when 
some wise people consider them to be semisuperfluous, we sketch Figure 
4.86. We make a partition P of the intervala < x < b into subintervals. 
Supposing as usual that xr-ı S x* S xk, we use the number 


(4.87) A (xe) Axx 


as an approximation to the volume of the slab which lies between the 
planes having the equations x = x,_; and x = x;. Multiplying by the 
density 6 gives an approximation to the mass of the slab. When the 
norm of P is small, all points of the slab lie at about the same distance 
|x — | from the plane having the equation x = £ and multiplying the 
mass by (xf — £)” should therefore give a good approximation to the pth 
moment of the slab about the plane having the equation x = & The 
Riemann sum 


(4.871) bE (x* — £)PA(x#) Axr 


should then be a good approximation to the moment of the whole body. 
Since the Riemann sums have a limit which is the Riemann integral in 
the right member of the formula 


z x= 


(4.872) M2; = 5 f (x — A(x) de, 


our work motivates the definition by which the required moment is 
defined by this formula. Analogous formulas define moments about 
planes parallel to the other coordinate planes. 
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In case p = Q, the pth moment is the mass of the body. We shall not 
comment upon second moments of solid bodies about planes, but brief 
comments about first moments may be appropriate. As was the case for 
laminas, our body B determines a point in E; which is called the centroid 
of the body. With reference to the particular coordinate system which 
we have chosen, the x coordinate of the centroid is the number # for which 


M2, = 0 when & = 7. Thus 


b 
(4.88) ô J, (Œ — DA) dx = 0 
and it follows that 


b ô ih xA(x) dx 
(4.881) Ma = ô | xA(x) dx, # = L, 
a ô Í A(x) dx 


where M, the denominator in the second formula, is the mass of the body 
B. Analogous formulas serve to determine the other coordinates ¥ and Z 
of the centroid. As was the case for centroids of laminas, the centroid 
of a body B in £; has an important physical property. An ordinary 
wheel mounted on an axle through its center balances in the gravitational 
field of the earth which is (so far as an ordinary wheel near the surface is 
concerned) nearly a uniform parallel force field. Similarly, the body B, 
when mounted on an axis through its centroid, must balance in a uniform 
parallel force field. Ifa plane r is a plane of symmetry of the body B, 
then the centroid of B is a point inr. Ifa line L is a line of symmetry of 
B, then the centroid of B is a point on L. Ifa point P is a center of sym- 
metry of B, then the centroid of B is P. 

All through this section, the moments that have appeared have been 
“moments of mass,” that is, moments of lamina or solid bodies that possess 
mass. Our methods and formulas are easily modified to produce numbers 
that are “moments of area,” that is, moments of sets in He that possess 
positive area, and “‘moments of volumes,” that is, moments of sets in £; 
that possess positive volumes. The moments and the centroid of a set 
S in E which possesses positive area are, by definition, the same as those 
of the lamina of unit areal density (unit mass per unit area) which coin- 
cides with the set. Similarly, the moments and the centroid of a set S 
in Æ; possessing positive volume are defined to be the moments and the 
centroid of a body of unit density (unit mass per unit volume) which 
coincides with S. Thus formulas for moment and centroids of “geo- 
metrical” sets are obtained by putting ô = 1 in formulas for “moments 
and centroids of mass.” These concepts are introduced because they 
are useful. For example, calculations involving forces which bend a beam 
depend upon a number J which is the moment of inertia of a cross section 
of the beam about a line through the centroid of the cross section. 
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Problems 4.89 


1 Find the pth moment about the line having the equation x = & of the 
lamina of constant areal density (mass per unit area) ô which occupies the plane 
region consisting of points (x,y) for which 


aSxSb, OSySh Ans: TH = BP — (a ~ BPH, 


2 <A semicircular disk of radius a has its center at the origin and lies in the 
right half-plane containing points (x,y) for which x 20. Find its centroid. 


__ 4a _ 
Ans.: % = 37> 5 =0. 


3 A homogeneous spherical ball of radius @ has its center at the origin. 
Find the centroid of the hemispherical part of the ball containing points for 
which x 20. Ans.: # = $a,5 =0,7=0. 

4 Prove that the centroid of a right circular conical solid of height A has dis- 
tance h/4 from the base of the solid. 

5 Find the pth moment about the y axis of the region bounded by the x 
axis, the line having the equation x = 1, and the graph of the equation y = x", 
it being supposed that 7 is a nonnegative constant. Ans.:1/(p +r + 1). 

6 Copy Figure 1.292 and let T be the triangular region bounded by the 
triangle having vertices at 4, B, C. Set up and evaluate all of the integrals 
required for evaluation of |T], the area of T, M2), the first moment of T 
about the y axis, and M‘,, the first moment of T about the x axis. Then 
use the formulas |T| = M®, and |T| = MẸ}, to find # and 5. Remark: 
The point (%,9) is the point (24/3, 0). This shows that the centroid of T lies on 
the median 4D. More remarks can be made. 

7 This problem involves hydrostatic forces which liquids exert upon surfaces 
of bodies immersed in them. Before formulating our problem, we digress to eke 
out some information. If an ordinary rectangular or cylindrical tank has hori- 
zontal sections having area 4 square feet and if the tank is filled to depth d feet 
with a liquid weighing w pounds per cubic foot, then the total weight of the con- 
tents of the tank is wdA. If we divide this total weight wd by the area 4 of 
the base of the tank, we obtain the number wd, which is the weight per square 
foot that the base supports. This number wd, the product of w and the depth, 
is called the pressure at depth d. This pressure wd is a scalar, the magnitude of 
the force per unit area which the liquid exerts upon the flat horizontal base of 
the tank. Our next task is to capture the idea that the jumble of words “pressure 
in a gas and in a liquid is transmitted (sent across?) equally in all directions” 
is often presumed to convey. To be very humble about this matter, we can 
believe or perhaps even know that water will spurt from a hole in the bottom of a 
tank of water and will spurt almost as vigorously from a hole near the bottom 
of the tank but in a vertical side of the tank. Fortified by this idea, we can 
cheerfully accept the ponderous physical principle or law which says that if a 
plane region having area A is beneath the surface of a liquid, and if dı and dz 
are numbers such that each point of the region has a depth d for which dı S d S 
d+, then the force which the liquid exerts upon one side of this region is orthogonal 
or normal or perpendicular to the region and there is a number d* such that 
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dı £ d* < dand the magnitude of the force is p*4, where p* is wd*, the pressure 
at depth d*. Our little lesson in hydrostatics is ended, and we can now formulate 
our problem. We confine our attention to forces upon one side of a plane region R 
which lies, as in Figure 4.891, beneath the sur- 
face of a liquid and ina vertical plane. The x 
axis is taken to be horizontal and in the top 
surface of the liquid. The y axis is taken to at-~-—-—--—-=s 
be vertical with y positive measured down- 
ward; this means that the point (x,y) lies y 
below the x axis when y > 0. It is supposed 
that the region R lies between the lines having 

the equations y = a and y = b and that, when 
a<y* <b, the line having the equation Figure 4.891 
y = y* intersects R in an interval (or collec- 

tion of intervals) having length (or total length) f(y*). It is assumed that the 
region R and the function f are bounded. It is not necessary that f be con- 


tinuous, but we do assume that R has area |R| and that |R| = J. dx. Our 
a 


Top surface of liquid 


problem is to set up an integral for the magnitude of the force which the liquid 
exerts upon one side of the region R. The procedure should now be completely 
familiar. We make a partition of the interval a S y £ b into subintervals and 
choose y% such that yz-1 £ y* S yr The number 


(1) flog) Av 


is taken as an approximation to the area of the part of R that lies in a strip parallel 
to the surface of the liquid. Multiplying this by wy% gives an approximation to 
the magnitude of the force of the part of R. Since the forces on the parts of R 
all have the same direction, the sum 


(2) weyet (ye) Ave 


gives an approximation to the magnitude |F| of the force on the whole region R, 
and the approximation should be good when the norm of the partition is small. 
Thus it should be true that 


(3) IF| = lim w2yzf(yz) Ayr- 


Since the right member of (3) exists and is a Riemann integral, our work motivates 
the definitions where |F| is defined by the formula 


(4) IF| = w f; »f0) ay. 


In order to make numerical calculations, we must know or be able to compute 
a,b, and f(y). The really interesting thing about our result is that it can be put 
in the form 


(5) E| = wy, 


where 4 is the area of the region R and § is the depth of its centroid. Thus the 
magnitude of the force on the plane region R is the product of the pressure at the cen- 
troid and the area of the region. Many problems can be solved very quickly by 
use of this fact. 
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8 Using the fact that the pressure at depth d below the surface of water is 
wd, but without using more formulas from the preceding problem, find the mag- 
nitude of the force exerted upon one face of an isosceles right triangle submerged 
in water so that one leg is horizontal and 5 feet below the surface while the other 
leg extends 3 feet upward. Ans.: 18w. 

9 According to an examination given at Cornell, the cost, in dollars per mile, 
of improving the road from Alibab to the Babila oil field 400 miles down the road 
is 10,000 plus 500 V x, where x is the distance from Alibab. Find the total cost 
of the improvement. Ans.: 6 + $ millions. 

10 A circle of radius a lies in the xy plane and has its center at the origin. 
For each positive integer n, points Po, Pn, © © * , Pa are equally spaced on the 
arc of the circle lying in the first quadrant and, for each & for which 1 S k J n, 
a vector r; is drawn from the origin to a point on the circle between Py_; and P}. 
Show that 
(1) lim Ett ii tre igj), 

n— æo n T 
where, as usual, i and j are unit vectors on the x and y axes. Hint: Make use of 
the fact that if 


_ kt -T 
(2) 6; = on AG, = an 
then 
(3) Tk 2a (cos Ogi + sin O¢j) AO;, 
n T 


where 6; lies between 6,-1 and 6,. The left member of (1) is therefore the limit 
of a Riemann sum. 

11 One way to review Riemann integrals and make them seem simpler is to 
learn about Riemann-Stieltjes integrals. Let f(t) and g(t) be defined over an 
interval a S t S x, and let P bea partition of the interval a S ¢ S x with parti- 
tion points ¢, and intermediate points t% as in Section 4.2. If there is a number J 
such that to each e > 0 there corresponds a 6 > 0 such that 


n 


(1) | È ADe — sl- I| <e 
whenever |P| < 6, then J is called the Riemann-Stieltjes integral of f with respect 
to g over the interval a S t S x and is denoted by 


(2) JÉ IO 280). 


These integrals are very important in more advanced mathematics, and some 
people think that they should at least be mentioned in elementary calculus. 
Many people have devoted substantial parts of their lives to study of problems for 
which f(t) = #. Start picking up ideas by evaluating (2) when a = —1, x = 1, 
f(t) = t£ + 2: + 3, and g(t) = sgn t. Ans.: 6. 


4.9 Simpson and other approximations to integrals When fisa 
polynomial in x, and in some other cases, we can discover an elementary 
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function F for which Jf(x) dx = F(x) or f(x) = F'(x) and can then evaluate 
b eo 
Í f(x) dx by the calculation 


(4.91) Ji f@) de = Fa) |? = F® — FO). 


As was pointed out in Section 3.6, derivatives of elementary functions 
are always elementary functions that can be calculated by use of appro- 
priate rules. It must not be presumed, however, that if f is an elementary 
function, then there must exist an elementary function F for which 
f(x) = F’(x). While proofs of such things do not grow in ordinary 
gardens, it is nevertheless known that if f(x) is one or another of 


l e7 
— A —"» lA 
«/4 — sin? x x > 


then there is no elementary function F for which f(x) = F’(x). 
This section is devoted to methods by which we can obtain useful 
decimal approximations to 


(4.92) fe KA ax 


in cases where it is impossible or difficult to obtain a useful formula for a 
function F such that (4.91) holds. Some pedestrian methods are worthy 
of brief mention. When a reasonably accurate graph of f is drawn on 
graph paper as in Figure 4.93, we can ob- 
tain an informative approximation by 
counting the squares and estimating the 
partial squares that lie within the appro- 
priate region. Chemists and others who 
have access to scissors and appropriate 
scales can cut out the region and weigh the 
paper. Another method involves use of a Figure 4.93 
planimeter, an instrument which will reveal 
a useful approximation to the area of a 
region after it has been suitably adjusted 
and a needle point on a movable arm has 
traced the boundary of the region. In 
some situations, the simplest and most 
direct method is illustrated by Figure 4.931. Figure 4.931 
The interval a S x S b is cut into n equal 

subintervals of length h, where h = (b — a)/n, and a point xj is selected 
in the &th subinterval. Then 


(4.932) f: f(x) dx =e+h È f(x), 


sin x 


VI+, x V1 + sin? x, 


a b x 


where eis an error term and the sum is a particular Riemann sum. Of 
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course, we should try to minimize errors by choosing the heights of the 
rectangles in such a way that, in each strip, the area of the set which lies 
in the region but outside the rectangle is nearly equal to the area of the 
set which lies inside the rectangle but outside the region. 

This paragraph introduces the trapezoidal formula 


b 
(4.94) [sey ae = eta] Btn tot yet vo teat Bh 


the derivation of which will help us to understand the much better 
formula (4.95) which will appear in the next paragraph. We sepa- 
rate the interval a S x < b into n equal subintervals of length h, where 
h = (b — a)/n, by points xo, xı, © © © , Xn Such that xo = a, xn = b, and 
Xk = Xp-1 + h for each k = 1, 2,--+,n. As in Figure 4.943, where 


n = 4, we let ye = f(xx) foreach &. As an approximation to fe T(x) dx 


we use im L(x) dx, where L(x) = Ax + B and the constants are chosen 


such that the graph of L(x) = Ax + B is a line passing through the two 
points Po(xo,yo) and Pi(x1,y1). The details of the calculation 


(4.941) [re dx = f E + e x) | dx = preys 
Zo To 

are easily supplied; in case yo and yı are positive, the details are super- 
fluous because the quantities are equal to the area of a trapezoid and ele- 
mentary geometry shows that the formula is correct. Using (4.941) and 
analogous formulas, we see that 


(4.942) * fla) de = er + h IE 
Tki 

where the “error term” e will be “relatively small” if the graph of f over 
the interval xk-ı S x Sx. 
is “‘near” the chord joining 
P,-, and Pk. Summing 
the members of (4.942) 
gives the trapezoidal for- 
mula (4.94). 

To derive the trape- 
a=Xy xy xz x3 x,=b * zoidal formula (4.94), we 
Figure 4.943 began by approximating 

f(x) over the interval 
xo S£ x S xı by the function L(x) whose graph is a line passing through 
the two points Po and P;. To derive the more useful Simpson formula 


b 
(4.95) f IE de = e +5 lyo + dont 2ye + Aya + yt s. 
_ + 4yn—1 + nl, 
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in which n is always an even positive integer, we begin by approximating 
f(x) over the interval xo S x S x2 by the function Q(x) of the form 


(4.951) O(n) = A(x — x1)? + Bee — x) +C 


whose graph passes through the three points Po(%o,yo), Pi(%1,y1), and 
P.(x2,y2). Since (4.951) can be put in the form Q(x) = 4x? + Bix + Ci, 
its graph is a parabola if 4 # Oandisalineif 4 = 0. Asis easy to guess, 
the graph of Q(x) is ordinarily a much better approximation to the arc 
P,P, than the graph consisting of the two straight chords PoP; and P,P, 
is, and hence the error term in the Simpson formula is ordinarily much 
nearer 0 than the error term in the trapezoidal formula. We find that 


f Q(x) dx = ze +B Gry + C(x — xı) pr 
SO 


(4.952) | ”* Olx) dx = z [24k + 6C]. 


The three formulas 


yo = Q(x) = Ow — h) = 4h? — Bh+C 
yı = Qx) =C 
yo = Q(x) = Or + h) = 4h + BA+C 


enable us to determine 4, B, C in terms of yo, yi, ye. It serves our 
purpose, however, to add the first and last of the formulas to obtain 


yo + yo = 24h? + 2C 
and to note that 4y1 = 4C so 
yo + 4y1 + yo = 24h? + 6C. 
This and (4.952) give the formula 


(4.953) in Q(x) dx = ; [yo + 4y + yə]. 
Using (4.953) and analogous formulas, we see that 


” Ha) dx =e, + 3 [yo + 4y1 + yal, 


= e + 3 [y2 + dys + yd, cee y 


Sh 
om 
R 
~~ 
> 

| 


n h 
f(x) dx = enp + 3 [Yn—2 + 4yn—1 + Yal. 
Adding these gives the Simpson formula 


b 
(4.96) Í f(x) dx = e + : [yo + 4y1 +292 + 4y2 + 2ye to: 
Í + Ayn-ı + Yal, 
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which appears in (4.95) and is so important that it merits reproduction, 
We recall that n must be even and that h = (b — a)/n. Whenever f is 
Riemann integrable over the interval a £ x < b, the error term eis near 
zero when n is large. When f is continuous, and perhaps in some other 
cases as well, experienced operators of pencils and slide rules and calcula- 
tors and electronic computers neglect the e and habitually use the remain- 
ing Simpson sum in the right member of (4.96) as an approximation to the 
integral. A particular sum is often judged to be as accurate as desired 
when this sum agrees to the desired number of decimal places with the 
sum obtained by doubling n. In many practical applications, sur- 
prisingly small values of n yield the desired accuracy. 

Nearly everyone who understands the trapezoidal and Simpson formu- 
las generates the following idea. It should be possible to derive still 
better formulas by approximating f by polynomials of higher degree 
having graphs passing through more of the points Po, Pi, Pe, Ps, © +> 
It turns out, however, that these formulas are more complicated than the 
Simpson formula, and using them for a given Å is not as satisfactory as 
using the Simpson formula with a smaller A. 


Problems 4.99 
1 Tables give 
f ° dx = log 2 = 0.69314 71806. 
1 


Show that the trapezoidal formula with n = 4 gives h = 4, yo = 1, yı = 4, 
ye = $, y: = 7 y4 = $, and 


[ose se+ H+ Ht $+ 44H = e+ 0.69702 4 


x 


and that use of the Simpson formula with n = 4 gives h = 4 and 
foe = e + roll +3 +3 tF + 3] = e + 0.69325 4. 


Show that the error terms are respectively —0.003877 and —0.000107. Observe 
that it is almost equally easy to use the trapezoidal and Simpson formulas. 
Remember that properly educated persons use the Simpson formula whenever 


suitable occasions arise, but that they rarely if ever use the trapezoidal formula. 
2 Tables give 


log 2.5 = 0.91629 07319. 


Using the Simpson formula with two subintervals, obtain the approximation 


251, fl, 4 , 17. 
f z= alat aoe tag | = 0.223148. 


Show how this and the last numerical result of Problem 1 give the approximation 


log 2.5 = 0.91640 2. 
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3 Using the Simpson formula with n = 2, obtain the approximations 


271, _01 
Si e= Flag tet GF] = 0.07696 106 
z 7 12.5 


5% 26° 27 
and 
2.718 ] 0.009 7 1 4 1 
t dy = UY È _ 
Jaz s= | a7 + ro + ag | = 000664 454. 


Use these formulas and the first formula of Problem 2 to obtain the approximation 
log 2.718 = 0.99989 633. 


Remark: With a little skill and a desk calculator that makes divisions, it is not 
difficult to extend these calculations to obtain good approximations to the number 


e = 2.71828 18284 59045 


. el 
for which f. x dx = | and loge = 1. Better ways to approximate logarithms 


and e will appear later. 
4 Someday we will learn the formulas 


1 ey 1 l _ 7 
[meer x te, J, pep & = F = 0.78539 81634. 


Use the Simpson formula to find approximations to the last of these integrals, 
and find the errors in the approximations, to obtain the numbers in the first two 
or three rows of the following table. 


n Simpson value Error 
2 . 78333 332 .00206 484 
4 . 78539 212 .00000 604 
6 .78539 782 .00000 034 
8 . 78539 802 .00000 014 
10 . 78539 809 .00000 007 
12 .78539 812 .00000 004 
14 .78539 812 00000 004 
16 78539 809 .00000 007 
18 .78539 812 .00000 004 
20 . 78539 809 .00000 007 
40 . 78539 789 .00000 027 
60 .78539 782 .00000 034 
80 .78539 782 .00000 034 
100 .78539 769 .00000 047 
200 . 78539 769 .00000 047 
400 .78539 569 .00000 247 
600 .78539 465 .00000 351 
800 .78539 425 .00000 391 
1000 . 78539 395 .00000 421 
10000 .78535 725 .00004 091 
15000 . 78535 442 .00004 374 
20000 . 78535 265 .00004 551 


100000 . 78499 059 .00040 757 
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Remark: The last cases show results obtained from an electronic computer that 
makes 8D calculations. When n is large, rounding errors seriously affect the 
last digit or digits kept. 

5 A loaded freighter is anchored in still water. At water level, the boat is 
200 feet long and, foreach k = 0,1,2,°°°, 20, has breadth yx at distance 104 
feet from the prow. Assign semireasonable numerical values to the numbers 
yp and do not allow anyone to claim that you have not partially designed a boat. 
Then use Mr. Simpson’s idea to approximate the area of the water-level section 
of your boat. Finally, recall an exploit of Archimedes and make an estimate of 
the number of tons of freight that should be removed in order to raise your boat 
1 foot. 

6 Use the Simpson formula to obtain decimal approximations to the follow- 
ing integrals. Keep two and three decimal places in the calculations, use a 
slide rule if possible, and use the value of n given in parentheses. 


(a) h x? dx, (n = 4) (b) h x3 dx, (n = 4) 
(c) im ; dx, (n = 2) (a) in sin x dx, (n = 6) 
(e) [ Vine de, (n = 6) nf EE ax, m= 6) 
(z) in (1 + x2) dx, (n = 4) (h) h e-z? dx, (n = 10) 
i) Í, ” a= dy, (n = 10) G) Í, * r= dx, (n = 4) 


7 Using the fact that x? > 3x when x > 3, show that 


8 Using the notation and ideas employed to derive (4.953), prove that if the 
graph of the function f for which 


(1) f(x) = K(x — x) + A(x —m)? + Blax) 


contains the three points Po(xo, yo), Pi(x1, y1), Po(x2, y2), then 


(2) SZ He) de =F lyo + Ay + yd. 


Remark: This result shows that the error term is zero and the Simpson formula 
gives the exact value of the integral when f is a polynomial of degree three or less. 
Thus we catch the idea that the Simpson formula gives good approximations even 
when the integrand cannot be closely approximated over the intervals x, Sx £ 
x%4.2 by quadratic polynomials but can be closely approximated over the intervals 
by cubic polynomials. Further investigation shows that if we add to the right 
member of (1) an integrable term $(x) for which |¢(x)| £ M(x — x1)4, then (2) 
will contain an error term e for which |e] < ($) MA‘. 
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9 An application of (2) of Problem 8 gives the famous old prismoidal formula 
H 
3) y= 2B + 41M + [Bol 


for volumes of solids. To investigate this matter, put xo = a and x = a + Hin 
(2) to obtain 


(4) fo" s@ ae = 2 [no + (042) 440+ 0], 


If a reasonably decent solid has bases in the planes having the equations x = a 
and x = a + H, and if for each x’ for which a S$ x’ S a+ H the plane having 
the equation x = x’ intersects the solid in a plane region having area f(x’), then 
the left member of (4) is the volume V of the solid. The quantity in brackets in 
(+) is the sum of the area |B,| of one base By, the area |B.| of the other base Ba, 
and four times the area |M| of the section M midway between the two bases. 
Thus the formula (3) is correct when the solid has volume V equal to the left 
member of (4) and f(x) has the form 


f(x) = Kix? + Kax? + Kax + Ky 


Nearly everyone acquires substantial respect for the prismoidal formula when it 
is discovered that the formula yields the correct formula for the volume of a 
spherical ball of radius a. In this case H = 2a, the bases are points having area 0, 
and the midsection M is an equatorial disk having area ra?. 

10 While the matter cannot be fully explored in a course in elementary 
calculus, we can know that persons who study Lebesgue measure and integration 
may learn that E; contains sets much queerer than those considered in this book. 
It can happen that each plane section perpendicular to the x axis is a square of 
unit area so that (in the context of Problem 9) f(x) = 1 when 0 <£ x < 1, but, 
nevertheless, the squares are so heterogeneously scattered that the set fails to 
possess a volume. For such queer sets the prismoidal formula is invalid because 
the left member of (4) of Problem 9 is not the volume of the set. Experts in the 
theory of measure can have sympathy for students of solid geometry who are a 
bit mystified by the “Cavalieri theorem.” This “theorem” says that two sets in 
E; have equal volumes if they have parallel bases and equal altitudes, and if each 
plane parallel to the bases intersects the two sets in two plane regions having 
equal areas. The queer sets which we have mentioned show that the “theorem” 
is false. Appendix 2 at the end of this book shows how we can reconcile ourselves 
to these matters. Some of us will learn more about these things than others, but 
we can all know that there is much to be learned. 


Functions, 


graphs, 


and numbers 


5 


5.1 Graphs, slopes, and tangents Itis quite possible that we first 
heard about tangents, or tangent lines, when we were very young. We 
may have been shown a circle as in Figure 5.11 and have been solemnly 
told that some lines in the plane of the circle intersect the circle twice, 
some others do not intersect the circle at all, and some others, the tan- 
gents to the circle, intersect the circle just once. When graphs more 
complicated than circles appear, no such simple story can adequately 
describe tangents. For example, the line T of Figure 5.12 intersects the 
graph twice and seems to be tangent to the graph at Po, while the line 
L intersects the graph only once and does not seem to be tangent to the 
graph. To attack this rather delicate matter, we start with a given 
function f defined over some interval and draw the graph G of y = f(x) as 


in Figure 5.13. We next select an x within the domain of f and call it 
284 
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yY 
P(xo + AX, Yo + Ay) 


T T L 
T 
Po(Xo» Yo) 
Po 
Xo xz 
Figure 5.11 Figure 5.12 Figure 5.13 


xo to emphasize the fact that it remains fixed throughout our discussion. 
Our task is to try to decide what we should mean when we say that 
a line T is tangent to G at P(xo,yo). We gain the possibility of making 
progress when we choose a number Ax for which Ax = 0, plot the point 
P(xo + Ax, yo + Ay) on G, and draw the chord joining our two points on 
G. Our first feeble idea can be that T is tangent to G at P(xo,yo) if the 
chord is nearly coincident with T whenever Ax is near zero. We can, so 
far as nonvertical tangents are concerned, improve this idea to gain the 
concept that the line T through P(%o,yo) having slope m is tangent to G 
at P(xo,yo) if the slope Ay/Ax of the chord is near m whenever Ax is near 
0. We know how to express this concept in terms of limits and deriva- 
tives, and we do it in the following definition. 

Definition 5.14 If f'(xo) exists, then the line T through the point (x0,yo) 
having slope f’ (xo) is said to be tangent to the graph of y = f(x) at the point 
(xoy. If f’ (xo) fails to exist, then the graph fails to possess a nonvertical 
tangent at the point (xo,yo). 

From this definition and the point-slope formula for the equation of a 
line, we obtain the following theorem. 

Theorem 5.141 Iff (xo) exists, then the equation 


y — yo = f'(x) (x — xo) 


is the equation of the tangent to the graph of y = f(x) at the point (xo,yo). 

To assist in the development and communication of ideas, it turns out 
to be exceptionally useful to agree that if a graph has a nonvertical 
tangent at a point (%o,yo), then the slope of this tangent will be called 
the slope of the graph at the point (%0,yo). In accordance with this idea, 
we adopt the following definition. 

Definition 5.15 If f’ (xo) exists, then f'(xo) is said to be the slope of the 
graph of y = f(x) at the point P(xo,yo). 

In order to obtain a full understanding of tangents to graphs, and for 
other purposes, it is helpful to know about “lines of support” of graphs 
and other point sets that lie in a plane. We confine attention here to 
cases in which f is a continuous function defined over a S$ x < b and 
Po(x0,yo) is a point on the graph of y = f(x) for which a < xo < b. A 
line L through Po is said to be a line of support of the graph of y = f(x) 
if there is a positive number 6 such that the part of the graph of y = f(x) 
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for which x) — ô < x < xo + ô lies entirely on or above L or lies entirely 
on or below L. To emphasize that tangent lines were defined as we 
defined them because of custom and not because of logical necessity, 
we can imagine that a man from Mars might come to our earth with a 
language identical with ours except that his meanings of the terms “line 
of support” and “tangent line” could be obtained by interchanging ours. 
This man from Mars might wonder why on earth we study our tangent 
lines instead of his. The problems at the end of this section may provide 
reasons. 

To be honorable, we must show that the remark made in Section 3.7 
about tangents to curves is in agreement with the ideas of this section. 
Putting z(t) = 0 gives the assertion that if r(ż) is the vector OP running 
from the origin to a particle P which traverses a curve C as t increases, 


and if 
(5.16) r(t) = x(#)i + yj, 


where x and y are differentiable functions of t for which r’(t) = 0, then, 
for each ż, the vector 


(5.161) rd) = «i+ y Oj 


is tangent to the path. In case the particle P always lies on the graph 
of the equation y = f(x), we always have y(t) = f(x(t)). Therefore, 


(5.162) r(t) = x(t)i + f), 

and differentiating with the aid of the chain rule gives the result that, 
at each time #, the vector 

(5.163) r(e) = x Hi + fee) 


is tangent to the graph. The hypothesis that r’(t) = 0 implies that 
x’(t) #0. Since x’(#) is a nonzero scalar, our result is equivalent to the 
statement that, for each x, the vector 


(5.164) it fj 
F ow, , . 
having its tail at the point (x,y) on the graph is 
(x,y) 3 ; . 
tangent to the graph at the point. With or with- 
Figure 5.165 out the aid of Figure 5.165, we can see that this 


vector lies on the line through (x,y) having slope 
f'(x). Thus the tangent line obtained by use of vectors is the same as 
the tangent line obtained by use of slopes. 

The remainder of the text of this section is devoted to a useful theorem 
which is, from our present point of view, thoroughly difficult. The 
theorem is important because it gives precise information that is very 
often used. The proof of the theorem shows that we must learn more 
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mathematics before we can fully comprehend the details. In the worst 
of circumstances, we are like a person who cannot swim but is thrown into 
the water and given a chance to fight for his life. Most of us will soon be 
swimming around in the scientific oceans, and Theorem 5.17 will slowly 
metamorphose from an ugly demon to a friendly angel. The theorem is 
closely related to the preceding paragraph and to the chain rule, but it is 
different from both. Using different notation, it sets forth conditions 
under which the first of the two equations 


x = fi), y = folt) 


can be ‘‘solved” for ż and the result substituted in the second equation to 
obtain y as a function of x. Moreover, the theorem tells how we can 
find a formula for the derivative of y with respect to x even though we 
cannot work out a useful formula that gives y in terms of x. The useful- 
ness of the theorem and the difficulty of the proof are both due to the 
fact that the conclusion of the theorem guarantees existence of various 
things. If we replace the condition x(t) > 0 by the condition x’(t) < 0 
in the theorem, the intervening details become somewhat different but the 
final conclusion (5.171) is valid. We could say that the theorem is a 
theorem about elimination of parameters, but in case fo(t) = tsot = y it is 
an inverse-function theorem. 

Theorem 5.17 Let x(t) and y(t) be continuous over the closed interval 
ti StS te and be differentiable over the open interval tı < t < te and let 
x(t) > 0 when ty <t<te Let x(tı) = a and x(t.) =b. Then a <b, 
and to each xo for which a < xo < b there corresponds exactly one to for 
which tı < to < te and x(to) = xo and to in turn determines exactly one 
yo for which yo = y(to). This correspondence between numbers xo and yo 
determines a function f for which yo = f(xo) when a < xo < b, and hence 
y = f(x) when a < x <b. Moreover, this function f is differentiable and 
the first of the formulas 


dy 

xy LO dy _ at 

(5.171) f'(x) = (ty T de 
dt 


is valid when x = x(t) and tı < t < te. The second is also valid when tt is 
understood to mean what the first does. 

To help us understand the things we do 
to prove this theorem, we start sketching 
Figure 5.172. We mark the points (#,a) 
and (t2,b) in a tx plane. For a schematic 
graph of x(t), we sketch a curve headed 
upward to the right because we think it 
should be so because x(t) > 0. Theorem 


Figure 5.172 
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5.27, which we do not bother to read now, proves that this idea is correct 
and that a < b. We next mark xo such that a < xo < b. We can 
easily believe that the rising graph of x(t) must intersect the dotted 
horizontal line at exactly one point (to,%0). Theorems 5.48 and 5.27 prove 
that this is correct, and we now have %. Our given function y, of which 
we have not sketched a schematic graph, then determines the number yp 
defined by yo = y(to), and we put yo = f(xo) in 

y Figure 5.173. This gives one point on the graph 
of y = f(x), and the same procedure gives each 


fe) other point on the graph. We now have the 
formula y(t) = f(x(t)). If we had proof that f 

is differentiable, we could apply the chain rule 

ol ax b ~ to obtain y’(t) = f'(x(t))x’ (t) and divide by x’(#) 
Figure 5.173 to get our answer, but this will not work because 


we do not yet have the required proof. We 
therefore start a direct attack upon difference quotients by taking a 
fixed ż for which tı < t < t and writing 


(5.174) f(x + At)) — f(*@)) = ya + At) — yC). 
Dividing by x(t + At) ~- x(t) gives the more promising formula 
cars) ‘Sexe +.) = fe) _ ye +A) ~ 9 


xt +A) -xA x(t + At) — x(t) 


Since y’(t) and x’(t) both exist and x’(z) = 0, we can divide the numerator 
and denominator of the right side by Az and see that the right side has the 
limit y’(t)/x’(t) as At— 0. The left side therefore has the same limit 
and we obtain 

_ | f(x(t + At)) — flx(t)) | _ y(t) 
(5-176) jair BATEL Fa) TA 


This seems to be almost the desired result (5.171), but we must use it to 
obtain additional information. Let e> 0. Choose a positive number 
5, such that 


(5.177) fat + At) — fe) _ rO| 


x(t + At) — x(t) x! (t) 


whenever |Az| < 6,. Another appeal to theorems given later in this 
chapter shows that there is a positive number ôs such that when |k| < 4, 
there is a number Aż for which |At| < 6; and 


(5.178) x(t + At) = x(t) +h. 
It follows that 

fa +4) —f(@®) y 
(5.179) ry Sr) <e 
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whenever |4| < 62. This gives (5.171) and completes the proof of 
Theorem 5.17. 

As was remarked, the proof of Theorem 5.17 is difficult because 
existence of various things must be proved. To help us understand that 
questions involving existence and differentiability of functions can be 
significant, we look at an example. Let us assume that y is a differen- 
tiable function of x for which 


(5.18) x? + y? + sin x + sin y + 46 = 0. 


Differentiating with respect to x with the aid of the chain rule then gives 


(5.181) 2a + 2y FË + cos x + cos yZ = 0 
or 

dy 
(5.182) (2y + cos y) = = — (2x + cos x) 


and, when (2y + cos y) = 0, dividing by (2y + cos y) gives a formula for 
dy/dx. The formula is illusory, however, because the original assump- 
tion is incorrect. The inequalities x? 20, y? > 0, sin x 2 —1, and 
sin y 2 —1 imply that, whatever x and y may be, 


(5.183) x? + y? + sin x + sin y + 46 2 44. 


Consequently, there are no numbers x and y for which (5.18) is true. 
The assumption that there is a differentiable function f, defined over some 
interval a < x < b, such that 


(5.184) x? + [f(x)] + sin x + sin f(x) +46 =0 (a<x<b) 


is false. This example can help us understand the nature of Theorem 
5.17. The theorem is not a weak one which tells what dy/dx must be if 
it exists. The theorem sets forth conditions under which dy/dx must 
exist and gives a formula which must be correct when these conditions 
are satisfied. Proof of a weak theorem can be obtained by mixing a few 
words with the calculation 


Ay dy 

dy Ay p At _ dt 

(5.185) Je T al ae T A Ar de 
A dt 


but this one line is very far from the equivalent of a theorem which sets 
forth conditions under which y is a differentiable function of x and the 
formula is valid. Examples show that matters involving (5.185) are 
not always completely simple. The distance r from Earth to Mars and 
the blood pressure p of a particular yogi are both functions of time t, but 
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the unqualified assertion that one of r and p is a differentiable function of 
the other is quite dubious. 

Returning to simpler considerations, we note that if the graph of a 
function has a tangent line at a point P on the graph, then the line through 
P perpendicular to the tangent is called the normal to the graph at P. 


Problems 5.19 


1 Find the equation of the line tangent to the graph of the given equation 
at the given point 


(a) y = x?, (1,1) Ans: y — 1 = 2(% — 1) 
(b) y = sin 2x, (0,0) Ans.: y = 2x 
(c) y = x log x, (1,0) Ans: y =x — l 
(2d) y = e7, (0,1) Ans.: y =ax + 1 
(e) y = sin x, (0,0) Ans: y =0 
(f) yu*« COS x, (27,27) Ans.: y =x 
(e) y = (x + x?)$, (1,32) Ans.: y = 240x — 208 


2 Find the equation of the tangent to the graph of the equation y = x" 
at the point (x1,x7).  Ans.: 


Tiy — (n — 1)x?. 


y = nxt 

3 First find the slopes of the graph of the equation y = x? at the points for 
which x = —l, x = —4, x = 0, x = 4, and x = 1. Use this information to 
help construct a figure showing the graph and five tangents. 

4 Find the area of the region bounded by the graph of y = x? and the tangent 
to this graph at the point (1,1). Ans.: 32. 

5 Even a crude graph suggests that at least one line can be drawn through 
the point (—2,—3) tangent to the graph of the equation y = x2 + 2. Investi- 
gate this matter. 

6 Sketch reasonably accurate graphs of y = sin x, y = x, and y = —x over 
the interval —2r SxS 4r. Let 


f(x) =x sinx 


and, after observing that f(x) = 0 when sin x = 0, f(x) =x when sinx = 1, 
and f(x) = —x when sin x = —1, sketch a graph of f(x). It is easy to guess 
that the graph of y = f(x) is tangent to the graph of y = x wherever sin x = 1 
and that the graph of y = f(x) is tangent to the graph of y = —x wherever 
sin x = —l. Provethatitisso. Hint: Calculate f'(x) and observe that cos x = 
O wherever sin x is 1 or —1. 

7 As we know, the part of the graph of the equation 


y = ya? — x? = (a? — x2)¥ 


for which —a < x < a is an“ upper semicircle” with center at the origin. Let 
Po(xo,0) be a point on this graph. Use definitions or theorems of this section to 
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prove that the graph has exactly one tangent at Po and that the equation of this 
tangent is 


y — yo = — 22 (x — xo). 
Yo 


Prove that this line is perpendicular to the line joining the origin to Py and hence 
that the definition of tangent given in this section is in agreement with ideas of 
tangents employed in elementary plane geometry. 

8 With or without more critical investigation of the matter, shetch a figure 


which indicates that the graph of the preceding problem has exactly one line of 
support at Po. 


9 Ifx = acostand y = asin ż it is easy to make the calculation 


dy dt _ acost 


dx dx —asint 


over each interval of values of ¢ for which sin ż > 0 or sint < 0. Letting xo = 


a cos to and yo = a sin to, we find that the equation of the tangent to the graph 
at the point Po(xo, Y0) is 


a COS to (x — 


— 0 = - : 
y y a Sin to 


x 
xo) or y— yo = — yo (x — xo). 
0 


Sketch a graph which shows the geometric interpretations of these things. 

10 Find the equation of the tangent to the graph of y = x3 at the origin. 
Sketch the graph and show that it does not have a line of support at the origin. 

11 Draw a graph of the equation y = |x|. Show that this graph has no 
tangent at the origin but does have many lines of support. Remark: Our word 
“tangent” has its root in a Latin verb meaning “‘to touch,” and a mathematician 
from Mars can defend his contention that our lines of support are “touching 
lines” and hence should be called tangents. We must, however, stick to our 
guns and insist that, in languages used on earth, these lines are not tangents. 

12 Sketch the graph of y = sin x and the normal to the graph at the point 
(x, sin x). The normal intersects the x axis at the point (f(x), 0). Determine 
whether f(x) increases as x increases. Hint: Borrow, from the next section, the 
unsurprising fact that f(x) is increasing over an interval if f'(x) > 0 over the 
interval. 

13 In connection with Problem 12, we note that problems in applied mathe- 
matics sometimes involve extraneous material that may obscure their mathe- 
matical aspects. A witch with a broom sweeps the x axis while walking along the 
graph of y = sin x in such a way that x is always increasing. She heeps the 
handle of her broom perpendicular to her path. Is the broom always pushing 
dust to the right? 

14 The two formulas 


d e Ld 
— sin * = cos %, —— COs x = — SIn x 


dx dx 
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say something very specific about the slopes of graphs of y = sin x and y = cos x. 
Sketch these graphs and observe that the formulas seem to be correct. 

15 Let x = L cos? 6, y = L sin? 6, where L is a given positive constant. 
Find the equation of the tangent to the graph 
at the point for which @ = ĝe and show that this 

tangent intersects the x and y axes at the points 
FERNER A(L cos 8o 0) and B(0,L sin 80). Show. that 
|4B| = 

16 Let x = L cos? 0, y = L sin? @ as in the 
preceding problem. Show that 


x% + y% = L% 


and use this to find equations of tangents and to 
= L of the preceding 
problem. Remark: The graph of these equations 
is, as we shall see later, a hypocycloid of four cusps. 
It appears in Figure 5.191. 

17 We now solve a problem that is similar to Problem 11 of Section 3.6. 
It is a rather tedious task to draw a graph of the equation 


Figure 5.191 


(1) x8 — x%y — 2x — 7x3 + yê = 721 


unless we have an electronic computer to help us dothe chores. The graph does 
contain the point Po(2,3), the constant 721 having been so determined that this 
is so. Our problem is to find the equation of the tangent (if any) to the graph 
at Po. Without being sure about the facts, we assume that there is a function 
ġ, defined over some interval 2 — 6 < x < 2 + 6, such that the part of the 
graph near Po has the equation y = ¢(x) and, moreover, ¢ is differentiable. 
Then (1) holds when y = $(x) and, with the aid of our formula for differentiating 
products of differentiable functions of x, we differentiate the members of (1) and 
equate the results to obtain 


dy dy 
5 2 Fy _ —?/— 2 way 
(2) 6x5 — x 7 2xy — 2 — 21x? + 6; T 0 
or 
(3) dy _ _ 6x5 — Ixy —2 — 21x? 

dx 6y5 — x? 


At the point (2,3) this has the value ~?8q. The required equation of the tangent 
line is 


(4) y — 3 = ri8e(x — 2), 


provided, of course, that our assumption is correct. 
18 Apply the method of the preceding problem to find the slope of the graph 
of the equation 
x? + y? = 25 
at the point (3,4). 
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19 [Itis easy to show that the graph of the equation 


(1) xt + 2x*y? + yt — 2x8 — Ix*y — 2xy? — 2y3 
+ 5x? + 5y? — 6x — 6y +6 =0 


contains the point (1,1). What have we learned that could make us sure that 
the graph contains another point? Ans.: Nothing. Remark: We do not yet 
have enough mathematical equipment to enable us to answer basic questions 
about natures of graphs of complicated equations. One who has or develops 
interest in such matters must continue study of calculus. Problem 7 and the 
following problems at the end of Section 11.3 provide reasons why profound 
study of graphs should follow (not precede) study of calculus. While the opera- 
tion gives no information about the natures of graphs of other equations, one 
who cares to do so may show that (1) can be put in the form 


(2) [x = 1)? + (y DIe + y? + 3] = 0 


and hence that the point (1,1) is in fact the only point on the graph. 
20 Let fbe the function for which f(0) = 0 and 


f(x) = x? sin $ 


when x Æ 0. Prove that f’(0) = 0 and hence that the x axis is tangent to the 
graph of f at the origin. Sketch the graph of f and tell why the x axis is not a 
line of support of the graph. Hint: To calculate /’(0), use the fact that 


f(~) — £0) 
x —0 


x sin | £ Fl (x = 0) 


and apply the sandwich theorem. 

21 When we study a science, it is sometimes worthwhile to obtain preliminary 
ideas about machinery that we are not yet prepared to understand fully. This 
is an example which involves curves and tangents. Let S be a set of points in a 
plane (in E2) which is bounded (this means that there is a rectangle which con- 
tains S), is convex (this means that if Pı and P2liein S, 
then the whole line segment joining Pı and Pzelies in S), 
and which contains at least one inner point (this means 
that there is a point P in S and a positive number 6 
such that S contains each point inside the circle with 
center at P and radius 6). Figure 5.192 shows an 
example. Let T (capital gamma) be the boundary of 
S; a point Q is a point of Tif each circular disk with 
center at Q contains at least one point in S and also 
at least one point notin S. We can wonder whether Figure 5.192 
T should be called a curve. We can observe that 
there may be points, such as 4, B, C, D in the figure, at which [ has many 
lines of support but has no tangent. We can observe that there may be 
points, such as Æ in the figure, at which T has only one line of support and 
has a tangent. We can say that I has a corner at a point B if B is on 
T and T has more than one line of support at B. We can wonder whether T 
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has a tangent at each point where it does not have a corner. We can wonder 
whether we can associate angles with corners in such a way that, whenever we 
take a finite set of corners, the sum of the angles at these corners must be less 
than or equal to 27. We can wonder whether it is possible to construct a set S 
such that I’ has a corner at each of its points. We can ask questions much 
faster than we can answer them. We can conclude that we have some very 
substantial and useful information about tangents, but we do not yet know 
everything. 


5.2 Trends, maxima, and minima Everyone who knows what it 
means to say “it has been getting hotter all morning” or “the temperature 
has been increasing all morning” should easily comprehend the following 
definition. In most of the applications we shall meet, the set S will bea 
set in E (that is, a set of real numbers) which is either (i) the whole set 
of real numbers or (ii) the set of numbers in a closed intervala Sx Sb 
or (iii) the set of numbers in an open interval a < x < b. However, S 
can be the set of positive integers or any other set in which we may be 
interested. 

Definition 5.21 A function f is said to be increasing over a set S in Ey 
if f(%1) < f(x2) whenever xı and xz are two numbers in S for which xı < xa. 
The function is said to be decreasing over S if f(%1) > f(x2) whenever xı and 
x are two numbers in S for which xı < xə. 

The following definition is more subtle. If the temperature was 30° 
from 10:00 a.m. to 11:00 A.M., an articulate and truthful person would not 
be expected to say that the temperature was increasing from 10:00 a.m. 
to 11:00 a.m. However, in accordance with the following definition, the 
temperature might have been monotone increasing all morning. 

Definition 5.22 A function f is said to be monotone increasing over a 
set S in Ey tf f(x) < f(x%e) whenever xı and x are two numbers in S for 
which xı < xe. The function 1s said to be monotone decreasing over S if 
f(x) 2 f(x) whenever xı and xz are two numbers in S for which xı < xə. 

The terminology in this definition is very useful, and it may seem to be 
less than utterly foolish when we realize that f is called monotone (some 
people have preferred the word monotonous) if it is either monotone 

increasing or monotone decreasing. For 

example, the function f having the graph 

S N shown in Figure 5.23 is increasing over 

o © Ə the interval a S x S xı, is decreasing 

a “1 *2 b over the interval x S x < b, is mono- 

Figure 5.23 tone increasing over the interval 

a= x S x and is monotone decreasing 

over the interval xı S x < b. To appreciate the necessity for the fol- 

lowing definitions, it may be sufficient to realize that it is impossible to 

be quite sure what is meant when someone says that “the temperature at 
Pike’s Peak reached a maximum at noon last Friday.” 
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Definition 5.24 Let f be defined over a nonempty set S in Ey. We say 
that f has a local (or relative) maximum over S at x, and that f(%o) is a local 
maximum of f over S, if there is a positive number h such that f(x) S f(xo) 
whenever x isin S and |x — xo < h. We say that f has a global maximum 
(or absolute maximum) over S at xo if f(x) S f(xo) whenever x is in S. A 
local minimum and a global minimum are similarly defined, the relation 
f(x) < f(xo) being replaced by f(x) = f(x). 

Applications of these definitions can be quite diverse. For example, 
f(x) might be the number of telephones ringing in Chicago x hours after 
the beginning of the nineteenth century and S might be the whole set of 
positive integers or any other set of numbers we wish to select. For 
many purposes it suffices to see how these definitions are applied when 
S is an interval and f is differentiable over the interval. Let f be the 
function whose graph is shown in Figure 5.25. Assuming that there is 


(e) (e) 
xı Xo X3 X4 b x 


Figure 5.25 


nothing deceptive about the graph, we can see that f is increasing over 
the intervals a S x S x, and xe S x < xand that f is decreasing over the 
intervals xı S x S xz and x4 Sx Sb. Supposing that f'(x) =0 so 
that the graph has a horizontal tangent at the point (x3, f(x3)), it appears 
that f has local maxima at xı and x4, a global maximum at x4, local 
minima at a, x2, and b, and a global minimum at a. There is neither a 
local minimum nor a local maximum at x3 even though f'(x) = 0. We 
have described the trends (the increasings and the decreasings) and the 
extrema (the maxima and minima) of f. 

Sometimes we are required to obtain information about a function f 
when we do not have a graph of f but do have a formula which determines 
values of f(x) for different numbers x. As the discussion of Figure 5.25 
indicates, it is often quite impossible to give precise information about a 
differentiable function f until we have found the values of x for which 
f'(x) = 0. These are the values of x for which the graph of f has hori- 
zontal tangents, and they are called critical values of x. After we succeed 
in finding when f'(x) = 0, when f'(x) > 0, and when f'(x) < 0, we may 
find it convenient to construct a figure 
more or less like Figure 5.251 in which we Figure 5.251 
(i) mark the points at which f'(x) = Qand $44 ==- 44h $44 -= 
the graph of f has horizontal tangents, (ii) a xX xy 4 b 
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put plus signs above intervals over which f'(x) > 0 and the graph of f 
has positive slope, and (iii) put minus signs above intervals over which 
f'(x) <0 and the graph of f has negative slope. Information about f 
can then be obtained with the aid of the two following theorems. 

Theorem 5.26 If a < x< b and if f’(xo) exists, then f cannot have a 
maximum ora minimum over the interval a S x S b at xo unless f'(x) = 0. 

Theorem 5.27 If f is continuous over an interval ao S x S bo and 
F(x) > O when ao < x < bo, then f is increasing over the interval 

ao < xs bo. 


If f is continuous over an interval ao S x S bo and f'(x) < 0 when 


ao <L x < bo 


then f is decreasing over the interval ao S x S bo. 

The second of these theorems is much more forthright and potent than 
the first. It will be proved in Section 5.5. The first theorem says that if 
a < xo < band f'(x) > 0 or f'(x) < 0, then f cannot have even a local 
maximum or a local minimum at xo To prove this, we suppose first 
that a < xo < b and f'(x) = p, where is a positive number. Then 


lim f(xo + Ax) — f(%o) = 
âz—0 Ax 


p 


and we can choose a positive number 6 such that 


axx—-8<x+h<d 
and 
f(xo + Ax) — f(x) 5? 
Ax 2 
whenever 0 < |Ax| < 6. If 0 < Ax < 6, then the denominator of the 
above quotient is positive, so the numerator must also be positive and 
f(x) < f(xo + Ax). If —ô < Ax < 0, then the denominator of the dif- 
ference quotient is negative, so the numerator must also be negative and 
f(%o + Ax) < f(x). Thus if x9 — ô < xı < xo < x2 < xo + ô, then 
f(x) < f(x) < f(x:2), so f cannot have either a local maximum or a local 
minimum at xo. In case a < xo < b and f'(x) < 0, a similar argument 
shows existence of a number ô such that if xo — ô < xı < xo < x < 
xo + ô, then f(xı) > f(x) > f(x2) and f cannot have a local maximum or 
a local minimum at xo This completes the proof of Theorem 5.26. 
It is quite as important to know what Theorem 5.26 does not imply as it 
is to know what the theorem does imply. It does not imply that f has 
an extremum (a maximum or a minimum) any place and it does not imply 
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that f has an extremum at xo. It does imply that if f has an extremum at 
xo, then xo must be either 


(i) one of the end points a and b or 
(ii) such that f’(xo) does not exist or 
(iii) such that f'(x) = 0. 


The points xo in these three categories are therefore the only ones that 
need be examined when we are seeking extrema of f over the interval 
a=x5b. This information, meager as it is, is often helpful. Figure 
5.271 may help us to understand it. The following theorems, which are 


l 

{ 

! 

J 
a c d e f g b x 
Figure 5.271 


easily interpreted in terms of Figure 5.25 and which are easy consequences 
of Theorem 5.27, give all the information we need to solve many problems. 

Theorem 5.28 (maximum) If f is continuous over a < x < b, if 
a < xo < b, and if there is a positive number h such that f'(x) > O when 
xo — h <x < xo and f'(x) <0 when xy < x < xot h, then f has a local 
maximum (which may be a global maximum) at xo. 

Theorem 5.281 (minimum) If f is continuous over a < x < b, if 
a < xo < b, and if there is a positive number h such that f'(x) < O when 
xo — h <x < xo and f'(x) > 0 when xo < x < xo +h, then f has a local 
minimum (which may be a global minimum) at xo. 

Theorem 5.28 says, in slightly different words, that if we travel a 
smooth road in such a way that we go uphill from 8:58 a.m. to 9:00 a.m. 
and go downhill from 9:00 a.m. to 9:02 a.m., we are atop a hill (but not 
necessarily atop the highest mountain) at 9:00 a.m. Theorem 5.281 has a 
similar interpretation. 


Problems 5.29 
I Letting f be defined over F, by the formula f(x) = x? — 2x + 3, show that 
f'(x) = 2x — 2 = 2(x — 1). 


Observe that f'(1) = 0, and then make the more profound observation that 
f'(«) <-0 and f is decreasing when x < 1 and that f'(x) > 0 and f is increasing 
when x > 1. Show that f(1) = 2 and use the information to sketch a graph of 
y = f(x). Give all of the facts involving extrema (maxima and minima) of f. 
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2 Letting f(x) = ax? + bx + c where a > 0, show that 
; b 
f'(x) = 2a (+ + z) 


Tell why f is decreasing when x < —b/2a and increasing when x > —b/2a. 
Show that f has a global minimum at the point 


(- b _ b? — tee) 
2a 4a f 


3 Letting f(x) = ax? + bx + c where a < 0, show that f is increasing when 
x < —b/2a, that f is decreasing when x > —b/2a, and that f has a global maxi- 
mum at —b/2a. 

4 Letting f(x) = 1/(1 + x?), show that fis increasing when x < 0, is decreas- 
ing when x > 0, and hence has a global maximum when x = 0. 

5 Show that the function f for which 


3 
fe) = TER 


is everywhere increasing and hence has no extrema. 
6 Show that 


is decreasing except when x = +1. 
7 Find all trends and extrema of the function f for which 


fe) = TER 


and sketch a graph of y = f(x). Hint: After calculating f'(x), put the result in 
the form 


wna . Ad +x) — x) 
TO) = TE ae 


and, after observing that f’(—1) = 0 and f’(1) = 0, find the sign of f'(x) over 
each of the intervals x < —1, —1 <x < l, and x > 1. Then find f(—1) and 
f(1) and make efficient use of this information. Find f'(0) and make the graph 
have the correct slope at the origin. 


8 Supposing first that x > 0, find the trends and extrema of the function 
f for which 


fle) =+ 


and sketch the graph of y = f(x). Then let x < 0 and repeat the process without 
use of symmetry, but use symmetry to check the results that are obtained. 

9 This problem requires us to think about making tanks from rectangular 
pieces of sheet iron. Starting with a rectangle 15 units wide and 24 units long, 
we cut equal squares from the four corners and fold up the flaps to form a tank. 
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Our first step is to draw Figure 5.291 and look at it. 
Our good sense should tell us that if the squares are 
small, then the tank will be shallow and the volume will 
be small. Taking larger squares should yield greater 
volumes unless we make the squares so large that the 
area of the base of the tank is small enough to overcome 
the advantage of making the tank deeper. To become Figure 5.291 
quantitative about this matter, we let x denote the 

lengths of the sides of the squares and ask how the volume X(x) of the resulting 
tank depends upon x. In particular, we want to know what x maximizes /'(x). 
Show that 


V (x) 


x(15 — 2x)(24 — 2x) 
4x3 — 78x2 + 360x 


and tell why x must be restricted to the interval 0 < x < 45. Show that 


V(x) = 12x2 — 156x + 360 
= 12(x — 3)(x — 10). 


Tell why V(x) is increasing when 0 < x < 3 and is decreasing when 3 < x < 45. 
Show that the maximum VF attainable is 486 cubic units. 

10 A sheet-iron tank without a top is to have volume VY. A rectangular 
sheet h feet high and 2rr feet long, costing A dollars per square foot, is bent and 
welded into a circular cylinder to form the lateral surface of the tank. A sheet 
2r feet square of different material, costing B dollars per square foot, is trimmed 
to form a circular base which is welded to the cylinder to form the tank. Find 
the radius and height of the tank for which the total cost T of the material (the 
total amount purchased, not merely the amount actually used in the tank) is a 
minimum. Ans.: 

[AV 1 3/6B27 
r= J 4B’ h = mN y aa 
Hint: Start by showing that 
T = 2rÁrh + 4Br? 


and then use the relation V = rr?h to express T in terms of just one of the vari- 
ables r and h. Standard methods may then be used to minimize T. 

11 Referring to Problem 10, find the radius and height of the tank for which 
the cost of the material actually used is a minimum. 

12 Referring to Problems 10 and 11, find the radius and height which mini- 
mize the cost of the material actually used in making 
a tank which has a top exactly like the base. 

13 A long rectangular sheet of tin is 2a inches 
wide. Find the depth of the V-shaped trough of 
maximum cross-sectional area (see Figure 5.292) that 
can be made by bending the plate along its central 
longitudinal axis. Ans.: a/+/2. 

14 After referring to Problem 13 and Figure 5.293, 
formulate and solve a problem involving construction 
of troughs having rectangular cross sections. 


Figure 5.292 
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15 Show that of all rectangles having a given area, a square has the least 
perimeter. 

16 Find the radius and height of the cone of greatest volume that can be 
made from a circular disk of radius a by cutting out (or folding over, as chemists 
do) a sector and bringing the edges of the remaining part together. 

17 An ordinary tomato can is to be constructed to have a given volume X, 
Determine the height k and radius r of the can for which total 
surface area is a minimum. 

18 As in Figure 5.294, the base and lateral surface of a 
solid right circular cone are tangent to a sphere of radius a. 
Find the height of the solid having minimum volume. 


Outline of solution: The height y and base radius r are related 


by the formula 
a r 


— 


y—=a s/y? 


which equates two expressions for sin 6. Squaring, solving 
for r2, and using the formula V = grr*y for the volume of the solid, we find that 


Figure 5.294 


The conditions of the problem require that y > 2a. If 2a < y < 4a, then 
dV/dy < 0 and F is decreasing. If y > 4a, then dV/dy > O and F is increasing. 
Thus V is minimum when y = 4a. 

19 Supposing that xı, x2, °° * , Xn are given numbers, find the values of x, 
if any, for which 


»» (x — xp)? 
k=1 


attains maximum and minimum values. 
20 The elementary theory of probability tells us that the number pn, defined 
by 


! 
Pn = ia bi p*(l — p) 


is the probability of exactly k successes in » trials when p is the probability of 
success in each trial. Supposing that n and k are given integers for which n > 0 
and 0 <S k S n, find the number p which maximizes p,,. Hint: Ignore the 
numerical coefficient and find the p which maximizes p*(1 — p)". Ans.: 
p = k/n. 

21 An observant senator observes that if he hires just one secretary, she will 
work nearly 30 hours per week but that each additional secretary produces con- 
versations that reduce her effectiveness. In fact, if there are v secretaries, x not 
exceeding 30, then each one will work only 


x? 


30 — 35 
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hours per week. Find the number of secretaries that will turn out the most 
work. Discussion and solution: If there are x secretaries, the number y(x) of 
hours of work done per week is 


2 3 
(1) y) = x (30 — $5) = 30 = F 


It is required that x be an integer in the interval 0 <x < 30, so there are only 
31 possibilities. We can calculate the 31 numbers y(0), y(1), > > - , y(30) and 
select the x which gives the greatest y(x). It is easier and more enlightening, 
however, to use some calculus. Forgetting momentarily that x is an integer 
number of secretaries, we observe that (1) defines y(x) for each real x. Differ- 
entiating gives 


He) ox xt 
(2) y) = 30-5 = 


Thus y’(x) > 0 and y is increasing when 0 S x < 1/300. Moreover, y(x) <0 
and y is decreasing when x > 1/300. Since 4/300 = 17.32, we see that y(x) < 
y(17) when 0 S x < 17 and that y(x) < y(18) when 18 < x < 30. Thus the 
answer is 17 if y(17) > y(18) and is 18 if y(18) > y(17). 

22 Asin Figure 5.295, a triangle is inscribed in a semicircular 
region having diameter a. Find the 0 which maximizes the area CaN 
of the triangle. Ans.: 0 = 1/4. z 

23 A printed page is required to contain A square units of Figure 5.295 
printed matter. Side margins of widths a and top and bottom 
margins of widths b are required. Find the lengths of the printed lines when 
the page is designed to use the least paper. Ans.: \/aA/b. 

24 Sketch a reasonably good graph of y = x? and then mark the point or 
points on this graph that seem to be closest to the point (0,3). Then calculate 
the coordinates of the closest point or points. Hint: Minimize the square of the 
distance from the point (0,$) to the point (x,x?). 

25 The strength (ability to resist bending) of a rectangular beam is propor- 
tional to the width x and to the square of the height y of a cross section. Find 
the width and height of the strongest beam 
that can be sawed from a cylindrical log 
whose cross sections are circular disks 
of diameter L. Ans. Width = L/+/3, 
height = +/2 L/ v3. 

26 The x axis of Figure 5.296 is the 
southern shore of a lake containing a little 
island at the point (a,b), wherea>0. A 9 Run z 
man who is at the origin can run r feet per Figure 5.296 
second along the x axis and can swim s feet 
per second in the water. He wants to reach the island as quickly as possible. 
Should he do some running before he starts to swim and, if so, how far? Partial 
ans.: He should run 


$ 


a — Fra? 
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feet if r > s and this number is positive. Investigation of the whole matter js 
not as simple as might be supposed. Hint: If x > O and the man runs from the 
origin to the point (x,0), we should be able to calculate (in terms of x and the 
given constants) the distance he runs, the distance he swims, and the total time 
T required to reach the island. 

27 Light travels with speed sı in air and with speed s2 in water. Figure 
5.297 can interest us in possible paths by which 
light might journey from a point 4 in the air to 
a point S on the surface of the water and then 
to a point W in the water. Show that the total 
time T is a minimum when the point S is so 
situated that the angle @; of incidence and the 
angle 62 of refraction satisfy the condition 


Figure 5.297 sin 6; _ sin 0z, 
S1 S92 


Remark: The above formula is the Snell formula, one of the fundamental formulas 
of optics. Phenomena such as the one revealed by this problem are of great 
interest in physics and philosophy. 

28 Asin Figure 5.298 a heavy object of weight W is to be held by two identical 
cables. A kind engineer tells us that the tension 
T in the cables is X Va? + x?/2x%. A solemn 
merchant tells us that the cost per foot of his 
cables is kT dollars, where T is the tension they 
will safely withstand. We must buy the cables, 
and we have a problem. Ans.: We buy 2+/2 a4 
Figure 5.298 feet of cable costing Wk/~/2 dollars per foot, so 

we need 2Wka dollars. 

29 Modify the preceding problem by supposing that the body must hang 
below the point which lies between 4 and B at unequal distances a and b from 
A and B. 

30 The lower free corner of a page of a book is folded up and over until it 
meets the inner edge of the page and then the folded part is pressed flat to leave 
a triangular flap and a crease of length L. Supposing that the page has width a, 
find the distance from the inner edge of the page to the bottom of the crease when 
L is a minimum and find the minimum L. Hint: Minimize L?. Ans.: a/4 and 
3 ~/3 a/4. 

31 Sketch the part G, of the graph of the equation 


1 
yrats 


that lies in the first quadrant and observe that the y axis and the line having 
the equation y = x are asymptotes of Gi. Someday we will learn that G, is a 
branch of a hyperbola and that the point V on G; closest to the origin is a vertex 
of the hyperbola. Find the coordinates of V and the distance from the origin 
to V. Ans.: (27%, (1 + +/2)2-%) and V2 + 2 4/2. 

32 A given circle has radius a. A second circle has its center on the given 
one, and the arc of the second circle which lies inside the given circle has length L. 
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Prove that L is maximum when an appropriate angle @ satisfies the equation 
cot 6 = 0. 

33 A spherical ball of radius 7 settles slowly into a full conical glass of water 
and causes an overflow of water. The glass has height a, and the lines on the 
surface of the glass make the angle 0 with the axis of the glass. Find the radius 
r for which the overflow is a maximum. Remark: This problem is famous because 
it is difficult enough to be remembered and discussed by those who have solved 
it. Solution: With or without careful scrutiny of other cases, we suppose that 
the ball is neither so small that it can be completely submerged nor so large that 
it will fail to be tangent to the glass when it is in its lowest position. Letting 
O, C, and B be the vertex of the _conical glass, the center of the ball, and the 
bottom of the ball, we see that |OC| = r csc 0 and \OB| = r(csc 0 — 1). The 
submerged segment of the ball has thickness h, where 


(1) = a — r(csc 6 — 1). 


The overflow (measured in cubic units) is equal to the volume V of this segment, 
and Problem 2 of Problems 4.59 shows that 


(2) V = grh?(3r — h) = r[h?r — $43]. 
Differentiation gives 

dV dh dh 
(3) Gare + tS — we S| 


-apg -2)] 


Using (1) and the formula for dh/dr calculated from it gives 


wW 
(4) <= a [a sin 6 — (sin 6 + cos 26)r). 


Since a sin 6 and (sin @ + cos 26) are positive when 0 < @ < 7/2, it follows that 
V is a maximum when 


(5) asin 6 


= sin 6 + cos 20 


34 When distances are measured in feet, the equation of the path followed 
by water projected from our fire hose is 


_ (1 + m?)x? 
y=mx— -0 


where m is the slope of the path at the nozzle which is located at the origin. Find 
the value of m for which the water will reach the greatest height on a wall 40 feet 
from the nozzle and find the greatest height. Partial ans.: One of the two answers 
is 9. 

35 Remark: The following big-government problem need not be taken too 
seriously; its purpose is to neutralize a problem involving a country that allowed 
its unemployed boomerang repairmen to starve to death. Determine the num- 
ber of officials that must be supported in a country containing n workers, and 
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use the result to determine the population of a Utopian country that minimizes 
the burdens which individual workers must bear. Hint: Information about 
officers in efficient productive organizations is not relevant, but Parkinson laws 


may be used. 
36 Find the minimum of the function F for which 


FA) = h [x? — (x + dP dx. Ans.: th 


5.3 Second derivatives, convexity, and flexpoints In Section 3.6 
we called attention to the connection between second derivatives and 
accelerations. This section shows how second derivatives can be used to 
obtain information about functions and their graphs. To begin the pro- 
ceedings, we look at Figure 5.31, which shows the graph of a function for 


a xı Xq Xg X4 Xs b zx 


Figure 5.31 


which the derivative (or first derivative) f'(x) and the second derivative 
(the derivative of the derivative) f” (x) exist when a <S x < b. Toget 
some ideas, we think of the graph as being a road in a vertical plane upon 
which we can travel from 4 toG, and we take the x axis to be at sea level. 
During the whole trip, we are always above sea level. The sign of f(x) 
gives us this information. At some times during the trip we are going 
uphill, and at other times we are going downhill. The sign of f'(x) gives 
us this information. As we travel from 4 to B, from C to D, and from 
E to F we are passing over depressions (or pits), and as we travel from 
B toC, from D to E, and from F toG we are passing over humps (or peaks). 
As we shall see, f” (x) is our source of information about these things and 
about points of inflection or flexpoints B, C, D, E, F at which slopes attain 
local extrema. 

The two following theorems are obtained by replacing f by f’ in Theo- 
rems 5.26 and 5.27. 

Theorem 5.32 If f' is differentiable over a < x < b anda < xa <b, 
then f cannot have a flexpoint at xo unless f" (xo) = 0. 

Theorem 5.33 If f' is continuous over an interval ap S x S bo and 
f”) > O when ao < x < bo, then f' is increasing over the interval ay S 
x S bo If f’ is continuous over an interval ay < x < by and f"(«) <0 


when ao < x < bo, then f’ is decreasing over the interval ay < x < bo. 
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The information contained in these theorems is sometimes very helpful 
when graphs of given functions f are being drawn. For example, Figure 
5.34 shows an application of the first part of this theorem; f’’(x) is positive 


f"(z) <0 


Slope increasing Slope decreasing 


f(x) >0 


Figure 5.34 Figure 5.35 


over an interval and f'(x), the slope, increases from —1 through 0 to +1 
as x increases over the interval. Figure 5.35 shows an application of the 
second part of the theorem; f’’(x) is negative over an interval and f'(x), 
the slope, decreases from 1 through 0 to —1 as x increases over the 
interval. Sometimes it is helpful to put ideas involving derivatives in the 
form 


” _@y _ d (dy _ dm _ dsilope 
(3.351) fe) = dx? dx (2) = dx dx 


The important thing to remember is that f” (x) is the derivative of f'(x) 
and that a positive second derivative implies an increasing first derivative 
and hence an increasing slope, and that a negative second derivative implies 
a decreasing first derivative and hence a decreasing slope. It is sometimes 
useful and even necessary to know about attempts to describe the dif- 
ferences between the arcs of Figures 5.34 and 5.35 in other words. The 
first runs through a depression and the second runs over a hump. The 
first bends upward and the second bends downward. The first is convex 
upward and the second is convext downward. In the first case, the 
chord joining two points on the graph lies above the arc joining the two 
points, and in the second case the chord lies below the arc. In the first 
case each tangent to the graph lies (at least locally) below the arc, and 
in the second case each tangent lies (at least locally) above the arc. 

The virtue of the following theorem lies in the fact that it is a “local 
theorem” which we can apply without determining signs of functions 
over whole intervals and which is therefore sometimes easier to apply 
than Theorem 5.28. l 

Theorem 5.36 If f'(x) = 0O and f’'(xo) > 0, then (as Figure 5.34 
indicates) f has a local minimum at xo. If f'(x) = O and f” (xo) < 0, 
then (as Figure 5.35 indicates) f has a local maximum at xo. 


t In mathematics and optics a point set (which might in some cases be a lens) 1s convex 
if it contains the line segment joining Pı and Pz whenever it contains P, and Ps. The set 
is sometimes said to be concave if itis not convex. When we say that a part of a graph in the 
xy plane is convex upward, we mean that the set lying above it is convex; we do not mean 
that the graph is a convex set. 
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To prove the first part of this theorem, let f’(xo) = 0 and let f” (xo) = b, 
where p is a positive number. Then 


lim f' (xo + Ax) ~~ f' (xo) = b. 


Az—0 Ax 


Let e = p/2. Then there is a positive number ô such that f' (xo + Ax) 
exists when |Ax| < 6 and 


flea + Ax) — fie) 2 
Ax 2 
whenever 0 < |Ax| < 6. But f’(%o) = 0, and hence 


f’ (xo + Ax) > 0 
Ax 


and therefore f' (xo + Ax) and Ax are both positive or both negative when- 
ever 0 < |Ax| < 6. When x < x < xo + ô, we can set x = xo + Ax 
and conclude that 0 < Ax < ô and f'(x) > 0. When xo — ô < x < xo, 
we can set x = xo + Ax and conclude that —ô < Ax < 0 and f'(x) < 0. 
It therefore follows from the last part of Theorem 5.28 that f has a local 
minimum at xo. In case f’’(x%o) < 0, everything is the same except that 
some signs are reversed and f has a local maximum at xo. 


Problems 5.39 


1 Sketch a graph of y = 1/x. Calculate dy/dx and d?y/dx?. If appro- 
priate connections between these things are not immediately clear, there are 
only three possibilities: (i) the graph needs repairs or (ii) the formulas for deriva- 
tives need repairs or (iii) the text of this section must be studied more carefully. 

2 The values of 


fe) = 13 


are certainly near 0 when x is near 0 and when |x| is large. Give a full account 
of the nature of the graph. 


3 Supposing that a, b, and ¢ are constants for which a > 0 and that 
f(x) = ax? + be +e, 
calculate f'(x) and f(x). Show that the only extremum of fis a minimum which 


is attained when x = —b/2a. Show that the graph of f is everywhere bending 
upward and that there are no flexpoints. 


4 Supposing that a, b, c, d are constants for which a > 0 and that 


f(x) = ax? + bx? + ex 4+. d, 
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calculate f'(x) and f”(x). Show that the graph of f has exactly one flexpoint 
for which x = —b/3a and that /’ is increasing when x > —b/3a. 

5 Show that if the x, y coordinate system is chosen in such a way that the 
graph of 


y= xi + be? + ex +d 
passes through the origin and has its flexpoint at the origin, then b = d = 0 and 
y = x + cx. 


Show that the graph of the latter equation has no extrema if ¢ = 0 and has two 
local extrema if ¢ < 0. 
6 Starting with the first of the relations 


(1) y=(a@—x2) Ta SF, 
differentiate twice and obtain the second relation. Then start with the first 
of the relations 
2 2 2 dy 
(2) wre x+y% =0 


(3) 1+ ys sz 


and show that differentiating with respect to x gives the others. Use (2) and 
(3) to obtain the second relation in (1). Tell why you should expect the sign of 
the second derivative to be opposite to the sign of y. 

7 Supposing that a and p are given positive numbers and considering posi- 
tive values of x and y, use the two methods of the preceding problem to find 
d?y/dx? when 


x? + yP = aP, 


Make the results agree with each other and, for the case p = 2, with a result of 
the preceding problem. Tell why the sign of the second derivative should (or 
should not) depend upon 9 as it does in your answer. 

8 Supposing that a > 0 and b > 0, show that the graph of 


f(x) = asin (bx + ¢) 


has a flexpoint wherever it intersects the x axis. 
9 Sketch a reasonably accurate graph of the function f for which 


f(x) = xsinx 


and observe that the graph seems to have flexpoints on or near the æ axis. 
Show that if (x,y) is a flexpoint, then tan x = 2/x and y = 2 cos x. Remark: 
These results show that if (x,y) is a flexpoint for which |x| is large, then tan x is 
near 0, sin x is near 0, cos x is near 1 or —1, and y is near 2 or —2. 

10 Supposing that n is 10 or 20, sketch the graph of y = sin" x over the 
interval O S$ x S$ m/2 and mark a point which seems to be a flexpoint. Then, 
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supposing that n > 1, show that the graph has a flexpoint at the point (x,,y,) 
for which 


— l . — (1 +)" d — 1\7/2 
COS a = 77 or SIn Xn = -3 an vn = (1 ~=) . 


Remark: Unless we know about the famous number e, it is still not easy to esti- 
mate yn when n is large. When we have learned the first of the formulas 


lim (1 + zy =e, lim ya = e7» = 0.606531, 
N— © n n= 0 
we will be able to put x = —1 and take square roots to obtain the second one. 


11 This problem, like the preceding one, involves ideas. Supposing that 
n > l and 


(1) y = x”(2 — x)” = (2x — x°)”, 
show that 

241 — 
(2) 2 = 2n(2n — 1)(2x — x2)? E — 2x + a — q 


and hence that the graph of (1) has a flexpoint at the point (%nYn) where 


1 1 P 
w= l pT =j -zal 


12 Determination of the natures of the graphs of equations like 
(1) yt = x? (1 + x?) 


is an ancient and honorable pastime. Observe that if the point (x,y) lies on the 
graph, then so also do the points (x, —y), (—x,y), and (—x,—y). If we find the 
part G of the graph in the first quadrant, we can therefore use symmetry to obtain 
the rest of the graph. Henceforth we consider only points on the graph for 
which x 2 0 and y 2 0. For these points, 


(2) y = (xt + x), 


To each x there corresponds exactly one y for which the point (x,y) is on G 
Moreover, y 2 (x4) = x, so G lies on or above the line having the equation 
y =x. Show that, when x > 0, 


dy 2x3 + x d*y  x?(2x?— 1) 


8) de TIEFE BAT FAA 


Show that the slope is decreasing over the interval 0 < x < 1/+/2, increasing 
over the interval x > 1/+/2, and attains the minimum value 4/625< 73g at the 
flexpoint having the coordinates 1/./2 and 4/34. Show that, when x > 0, 


4 0 < Wx4 2 a = “S as 
DOS NET an TREO a et a) © Oa) T a 
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and hence that the line having the equation y = x is an asymptote of G. The 
graph of (1) is shown on a small scale in Figure 5.391. 


Figure 5.391 


13 Determine the natures of the graphs of the equations 


(a) yt = x(1 + x?) (b) yt = x(x? — 1) 
(c) yt = x(x? — 1) (d) yt = x(1 — x) 
2 2. _* 
(e) y =F O Y= Typ 
1 , 1 
e) Y-GPVE—4 A) P= CTE # 


14 Sketch graphs of y = x, y = sin x, and then y = x + sin x. Then make 
repairs in the last graph that may be necessary to make it agree with formulas 
for the slope and the derivative of the slope. 

15 Persons interested in themselves and the surrounding world should not 
neglect opportunities to learn about the honorable Gauss (or normal) probability 
density function ® defined by the formula 


_(2—™M)? 
20? 


1 
l (x) = —=— 
(1) (x) Vine’ 
in which ø (sigma) and M are constants for which ø > 0. We should know that 
e° = 1, and we can cheerfully accept the facts that 


(2) e = 2.71828 ---, e% = 0.60653 + °°. 


We want to determine the manner in which the graph of y = (x) depends upon 
the constants M (which is called the mean of ©) and ø (which is called the stand- 


ard deviation of $). Show that 
_(z— M)? 
2g? 


(3) p(x) = Ep (x — Me 
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M-30 M-o M Mio M+3oe x 
Figure 5.392 


Tell why it is true that, as Figure 5.392 indicates, ® is increasing over the inter- 
val x S M, ® is decreasing over the interval x 2 M, ® has a maximum at M, 


and ®(M) = 1/V2r o. Show that 


1 _(2-M)? 
(4) B(x) = TI — Myr — ate 


Tell why it is true that, as the figure indicates, ®’(x) is increasing and the graph 
is bending upward when x < M —o and when x > M + ø, whereas ®’(x) is 
decreasing and the graph is bending downward when M—ox<x< M +o. 
Finally, show that, as the figure indicates, 


0.60653 
Vin 0 


Remark: The index will tell where this and other bits of information about Gauss 
probability functions are concealed. We shall learn that 


(5) (M — o) = (M + o) = = 0.60653% (M). 


(6) [2 Ee) a =1, 


and budding scientists are never too young to start hearing that, in appropriate 
circumstances, the number 


(7) f. 8) dx 
is taken to be the probability that a number x lies between a and b. 

16 Sketch rough graphs of y = cos x, y = 2 cos x, y = cos 2x, and then 
(1) f(x) = 2 cos x — cos 2x 


over the interval O £ x S r. Find the maxima and minima of f and the flex- 
points of its graph. Make the results agree. 
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17 Make a sketch showing the points (x,y) on the graph of the equations 
x = a(¢d — sin 9), y = a(l — cos ¢) 


for which @ = 0, 7/2, 7, 34/2, and 27. Show that the graph is convex downward 
at each point for which y #0. Remark: The graph is acycloid. Answers can 
be simplified by use of trigonometric identities. Thus 


d 
I3 = a(l — cos ġ) = 2a sin? $ 


ig = esing = 2a sin È cos $ 
$ 
dy _ sin $ SF 
dx 1l—cos@? , @ ©?’ 
sin > 


so dy/dx > 0 when 0 < @ < r and dy/dx < 0 when t < ¢ < 2r. Moreover, 
since 


d of- 427 l 
T a r r ~ 
sin > 2 sin? = 
when sin $ x 0, we find that 
d dy d $ 

dy _ dds dp"? 1 

3 = an =z Ā—— = _ — — 
an 2a sin $ 4a sint £ 


when sin $ Æ 0 and hence when y Æ 0. The slope is therefore decreasing when 


y £0. 

18 Verify that the hypotheses and conclusion of the following theorem are 
satisfied when f(x) = sin x, g(x) = (sin x)/x, and a = r. 

Theorem If a> 0, if f has two derivatives over the interval Ò S x S a, tf 
fO) 2 0, if f (x) < O when 0 < x < a, and if g(x) = 
}(x)/x, then g is decreasing over the interval O < x < a. 
Remark: This theorem has a very interesting geometric 
interpretation. The hypotheses imply that, as in Fig- 
ure 5.393, f(0) 2 O and the graph of f is convex down- 
ward. The graph can make us feel that, as x increases, 
the angle @ must decrease and hence f(x)/x must 
decrease because f(x)/x is tan 9. It is, however, neces- Figure 5.393 
sary to recognize that feelings and impressions obtained 
by looking at one or a dozen figures do not constitute a proof of the theorem. 
To prove the theorem, we begin by observing that, when 0 < x < a, 


_ fe) — fle) _ hex 


0) 


x x 
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where h(x) = xf’(x«) — f(x). We will know that g is decreasing over the interval 
0 < x < aif we can show that A(x) <Owhen0 <x <a. Since 


h'(x) = xf" (x) +F) — F) = xf"), 


our hypothesis that f”(x) <0 when 0 < x < a implies that h’(x) < 0 when 
0<x< a. Thus A(x) is decreasing over the interval 0 < x < a. Since h is 
continuous and A(0) = —f(0) < 0, it follows that h(x) < 0 when 0 < x <a, 
This gives the conclusion of the theorem. 

19 Ideas of this section and the preceding one can be used to obtain informa- 
tion about the graphs of the Bernoulli functions Bo(x), Bi(x), Be(x), - - > that 
appeared in Section 4.3, Problem 10. We recall that Bo(x) = 1, that 


(1) Bax) = Bn-1(*) 
(2) I, B,(x) dx =0 


when n = 1, 2, 3, - + + except that (1) fails to hold when n is 1 or 2 and x is 
an integer, and that all of the functions except B(x) are continuous. To keep 
our task within reasonable bounds, we suppose we know the fundamental fact 
that B,(0) = 0, Ba($) = 0, and B,(1) = O when n is odd, that is, when = 1, 
3, 5,7, ° + > . We want to show, without making tedious calculations, that the 
miniature graphs of B,(x), B(x), ---, Bs(x) over the interval OS x <1 
appearing in Figure 5.394 give correct information about the trends and the 


B,(x) 


B(x) B,(x) B,(x) 
w mo ae —-._, oo 
Bs(x) Be(x) B,(x) B,(x) 


Figure 5.394 


zeros of these functions. When (12.384) and related formulas have been studied, 
we will be able to see that scales on the vertical axes have been adjusted to make 
the graphs visible; it can be shown that |Ba(x)| < 4/(27)* when n > 1 and hence 
that numerical values cannot be estimated from the graphs in Figure 5.394. 


Supposing that 0 < x < 1, show that the formulas By(x) = 1 and ` B(x) dx = 


0 imply that Bi(~) = x — $ and hence that the graph of B,(x) is correct. Show 
that the formula B(x) = B(x) implies that B(x) is decreasing over the interval 
O < x < $ and is increasing over the interval 4 < x < 1. Show that this fact 


and the formula f i B(x) dx = 0 imply that B.(0) > 0 and Bo(s) < 0 and hence 


that B(x) has exactly two zeros between 0 and 1. Show that the formula 
Be (x) = Bi(x) implies that the graph of Bs(x) is bending downward over the 
interval 0 < x < $ and is bending upward over the interval $ < x <1. Show 
that the formula B3(x) = Bo(x) implies that B(x) is increasing over the first 
part of the interval 0 < x < 1, is decreasing over a central part, and is increasing 
over the remaining part. Tell why B(x) is increasing over the interval 0 < x < 
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4 and is decreasing over the interval 4 < x < 1, and why B,(x) has exactly two 
zeros between 0 and 1. Continue this investigation until general conclusions 
about the functions B,(x) and their graphs have been reached. 


5.4 Theorems about continuous and differentiable functions 
It is possible to look at Figure 5.42 and others more or less like it and claim 
that these figures provide experimental evidence supporting the follow- 
ing theorem of which we shall give a stronger version in Theorem 5.52. 

Theorem 5.41 If L is a chord joining two points on the graph of a 
differentiable function, then there must be at least one point on the graph at 
which the tangent 1s parallel to L. 

There are at least two reasons why this theorem is surprising. It is 
thoroughly important, and it is impossible to prove it without making use 
of some substantial mathematical machinery that has not yet appeared in 
this course. The source of the difficulty 
can be stated very crudely by saying that L 
Theorem 5.41 would be false if there were AZ 
“holes”? in the set of real numbers so that 
the graph of Figure 5.42 contains no points 
having x coordinates x, and x2. To prove 
Theorem 5.41, and for many other purposes, 
we need a property or postulate or axiom 7% Xi x2 b 
which guarantees that the set of real numbers Figure 5.42 
is complete. While several different equiva- 
lent axioms can be given, the following one involving a fundamental idea 
of Dedekind (1831-1916) is in some respects the most natural one to 
adopt. 

Axiom 5.43 (Dedekind) Let the set of real numbers be partitioned 
into two subsets A and B (see Figure 5.44) in such a way that (i) each real 


A B 
a S S 
Xi g x2 

Figure 5.44 


number 1s put into either A or B, (ii) each of A and B contains at least one 
real number, and (iii) if xı is in A and x: is in B, then xy < xe. Then 
there is a real number & (the partition number xi) which is either the 
greatest number in A or the least number in B. 

Once again we are in a position where we should know something about 
our present state and prospects for future development. To attain full 
comprehension of the Dedekind axiom, and the manner in which it is 
used to prove basic theorems of mathematical analysis, is not a short 
task. Experience shows that, except in unusual special circumstances, 
it is quite unreasonable to suppose that enough time is available in a first 
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course in calculus to explore these matters thoroughly. In order to 
understand the proofs, it is necessary to sketch and study illustrations of 
various kinds, and progress is very slow. Until considerable experience 
has been obtained, it is not easy to reproduce the proofs even after they 
have been completely understood. Students who peek at the proofs 
can be compared with children of jewelers who peek at the innards of 
watches. ‘They start accumulation of knowledge of reasons why things 
tick, and the overwhelming importance of getting started must be 
recognized by everyone who knows that we toddle and walk before we 
run. So far as this course is concerned, it is of primary importance to 
understand the axiom and theorems of this section and to cultivate the 
habit of formulating and using precise mathematical statements. 

We begin a campaign to learn something about continuous functions 
and differentiable functions and their graphs by proving the following 
theorem. 

Theorem 5.45 If f 1s continuous over an interval a S x L b, then f is 
bounded over the interval, that is, there 1s a constant M such that 


f(~)| < M (asx sb). 


Our proof will use the Dedekind axiom. Let x; be put in 4 if xı S a. 
Moreover, let xı be put in 4 if a < xı S b and there is a constant M, 
such that |f(x)| S$ Mı when a S x S xı. Let B contain all other num- 
bers. This determines a Dedekind partition, and we can let ¢ be the 
partition number. Itis easy to see thata S £ < b, but the remainder of 
the proof is more delicate. Since f has right-hand continuity at a, we 
can lete = 1 and choose a positive number 6 such that f(a) — 1 < f(x) £ 
f(a) +1 and hence |f(x)| S |f| +1 whenever aSxS5a+65. 
Hence f is bounded over the interval a S$ x S$ a+ ô, so a + ô belongs to 
Aandé2a+6> a. Our next step is toshowthat?=b. IfE <b, 
then we have a < $ < b as in Figure 5.451. Since f is continuous at &, 


A B 
x a -ò &€ +ô b x2 


Figure 5.451 


we can let e = 1 and choose a positive number ô such that a < £ — ô < 
€+6<0b as in Figure 5.451 and |f(x)| < |A| +1 when ¢-— ô < 
x*S&+6. But £ — ô belongs to 4, so there must be a constant Mı 
such that |f(x)| < Mı whena S x < &— 6. If we let M, be the greater 
of Mı and |f(é)| + 1, then |f@)| S Mewhena S x S +8. Therefore, 
£ + 6 must belong to 4 and we have a contradiction of the fact that the 
partition number must be either the greatest number in 4 or the least 
number in B. All this shows that £ = b, and we are almost finished. 
Let e = 1 and choose a positive number 6 such that a < b — ô < b and 
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lf(x)| < f() + lwhend—6 Sx <b. Since bd — ô belongs to A, there 
must be a constant M; such that |f(x)| S Mz whena Sx <b— ô. If 
we let M be the greater of M; and |f(d)| + 1, then |f(x)| < M when 
a<x<=b. This completes the proof of Theorem 5.45. 

To strengthen Theorem 5.45, and for other purposes, we need informa- 
tion about upper and lower bounds. A set S of numbers is said to have an 
upper bound M,if x £ Mı whenever x isinSandissaidtohavea least upper 
bound (l.u.b.) or supremum (sup) M if M 


. + OO 
is an upper bound and there is no upper m, m x MM «x 
bound Mı for which M, < M. Analo- Figure 5.453 


gously, S is said to have a lower bound m, 

if x 2 mı whenever x is in S and is said to have a greatest lower bound 
(g.l.b.) or infimum (inf) m if m is a lower bound and there is no lower 
bound mı for which mı > m. The inequality 


(5.452) msmsxSMEM, 


shows how these numbers must be related when x is in S, and Figure 
5.453 shows a way in which they are sometimes related. 

Theorem 5.46 If anonempty set S of numbers has an upper bound M,, 
then it has a least upper bound. Similarly, if a nonempty set S of numbers 
has a lower bound mı, then it has a greatest lower 


bound. A _B 
As Figure 5.461 indicates, we make a partition  %-1 $M 
of numbers by putting a number in B if it is an Figure 5.461 


upper bound of S and putting a number in 4 if it is 

not an upper bound of S. The set B contains M,, and if xo is a number in 
S, then 4 contains the number xo — 1. Let & be the partition number. 
Let x be a number in S. Then, for each positive number e, x S & + e. 
Hence x < £ and it follows that isan upper bound of S. Ifx’ < & then 
x’ is in 4 and hence x’ is not an upper bound of S. Therefore, & is the 
least upper bound of S. This completes the proof of the first part of the 
theorem. The second part is proved similarly. 

Theorem 5.47 If f is continuous over an interval a S x <S b, thenf(x) 
attains minimum and maximum values over the interval at points of the 
interval, that is, there exist numbers m, M, xı, and x2 such that a S xı S 
b,a S x Sb, and 


m = f(x1) S f(x) Sf(m2) = M 


whenevera S x sb. 

To prove the part of the theorem involving M, we use Theorem 5.45 to 
conclude that f must have an upper bound Mı. It follows from Theorem 
5.46 that the set of numbers f(x) for which a S x S b has a least upper 
bound which we now denote by M. Then f(x) < M when a S x £ b, 
and it remains to be shown that there is a number xz for which f(x) = M. 
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If f(a) = M or f(b) = M, we set x: = aorx, = b. Otherwise, assuming 
that f(a) < M and f(b) < M, we determine the required number x, 
by means of a Dedekind partition. Put a number x’ in 4 if x’ S a and 
also if a < x’ < b and for some e > O the interval a < x S x’ contains 
no point for which f(x) > M — e. Let B contain all other numbers, and 
observe that b isin B. Let x be the partition number of this partition. 
Clearly, f(x.) < M. If we assume that f(x) < M, say f(x) = M — e, 
where eo > 0, we can choose a positive number ô such that a < x — ô < 
xo + ô < band f(x) < M — eo/2 whenever xo ~ ô S x S x: + 5. The 
fact that x. — ô is in 4 will then enable us to draw the erroneous con- 
clusion that x. + ô is in 4. Therefore, f(x2) = M. This completes the 
proof of the part of Theorem 5.47 involving M. To prove the part of the 
theorem involving m, we can use an analogous argument. We can, 
alternatively and more easily, use the 
fact that —f(x) must have a maximum 
—m attained when x = x, and hence that 
f(x) must have a minimum m attained 
when x = xı. 

The following theorem is known as 
the intermedtiate-value theorem; Figure 
5.481 provides experimental evidence. 

Theorem 5.48 If f is continuous over an intervala S x S b andifk 
is a constant for which f(a) < k < f(b) or f(a) > k > f(b), then there exists 
at least one number £ for which a < £ < b and f(E) = k. 

Taking first the case in which f(a) < k < f(b), we prove the result with 
the aid of a Dedekind partition. Let x; be put in 4 if xı S a and also if 
aS xı S b and f(x) < k whenever a Sx S xı. Let x be put in B if 
xo 2 b and also if a < x. < b and the interval a S x S x contains a 
number x for which f(x) 2 k. Let & be the partition number. Since 
f(a) < k and f has right-hand continuity at a, we can let e = k — f(a) 
and choose a positive number ô such that | f(x) — f(a)| < e and hence 
f(x) <k when aSxSa+é6. Hence a+ ô belongs to 4 and §2 
a+é>a. A similar argument shows that ¢ < b. Therefore a < £ < 
b. If we suppose that f(£) > k, then we can choose a positive number ô 
such that a < £ — ô < £+ ô <b and f(x) >kwhen§—SiSxSE+ 
ô. This contradicts the fact that f(x) < k when a < x < £ — ô and 
f(x) > k for some x in the intervala < x < £+ ô. If we suppose that 
f( 2k, a similar argument leads to a contradiction. Therefore 
f(é) = k. A very similar proof covers the case in which f(a) > k > f(b) 
and Theorem 5.48 is proved. 

In ordinary circumstances we try to be too efficient to clutter our books 
and our memories with obvious corollaries and applications of our 
theorems, but one corollary of the intermediate-value theorem is so 
important that we relax to look at it. Jf a function f is negative at x 


Figure 5.481 
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and is positive at x2, and if f is continuous over the closed interval having 
end points at xı and Xe, then there must be at least one x3 between xı and xs 
for which f(xs) = 0. This implies that the graph of a continuous function 
f cannot run from a point (%1,71) below the x axis to a point (x2,y2) above 
the x axis without intersecting the x axis at a point (x30) for which x 
lies between x, and x2. Instead of asking whether this result is ‘“‘obvious,” 
we can ask whether it is obvious that a man cannot walk from the Capitol 
of South Dakota to the Capitol of North Dakota without stepping upon 
the common boundary of the two Dakotas. 


Problems 5.49 


1 With the text of this section out of sight, try to produce adequate responses 
to the following orders; if unsuccessful, study the text again and try again. 


(a) Write a full statement of the Dedekind axiom. 

(b) Write a theorem which gives precise information about boundedness of con- 
tinuous functions. 

(c) Write a theorem which gives precise information about extrema of con- 
tinuous functions. 

(d) Write a full statement of the intermediate-value theorem. 


2 Using known properties of the function f for which f(x) = x2, show how 
the intermediate-value theorem (Theorem 5.48) can be used to prove that there 
is a positive number & for which £2 = 2. Give all of the details, recognizing 
that Theorem 5.48 cannot be applied until appropriate values of a and b have 
been captured. 

3 Modify the work of the preceding problem to prove that there must be 
at least one x for which f(x) = 0 when 


(a) f(x) =x? —7 (b) f(x) =x? —x -—7 

©) =~p~- 4 (4) f(x) = æ — cos x 
4 Letting 

(1) f(x) = LT txt x?t x3 + x4, 


determine whether there are any numbers x for which f(x) <0. Hint: Use the 
fact that f(1) = 5 and 

x 1 
(2) f(x) = 


5 — 
x— i 


(x = 1). 


Show that x5 = 1 only when x = 1, so f(x) is never zero and hence f(x) is never 
negative. 

5 Let f be defined over the closed interval ~1 S x S 1 by the formulas 
f(0) = 0 and f(x) = 1/x? when —1 Sx £ l andx #0. Show that there is no 
constant M such that |f(x)| S M when —1 Sx S1. Solution: Suppose, intend- 
ing to establish a contradiction of the supposition, that there is a number M 
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for which |f(x)| S M when -1 Sx S1. Letx =1/-V|M|/ +2. Thenx #0 
and —1 < x < 1, but f(x) = 1/x? = |M| + 2, so |f(x)| > [M| = M. 

6 Read Theorem 5.45. Then construct a figure which illustrates the mean- 
ing of the following remark. Ifa < band M > 0, then condition 


(1) [f(x)| < M (aSsxsė) 


is satisfied if and only if f is defined over the intervala S x S b and the graph of 
y = f(x) over the interval a S x S b lies between the graphs of the lines having 
the equations y = ~M and y = M. Note that this gives a “geometrical mean- 
ing’ to Theorem 5.45. Note that the inequality (1) holds if and only if M > 0 
and —M < f(x) < M whena Sx <b. 

7 Sketch a graph of the function f for which f(0) = 0 and f(x) = 1/x when 
x #0 and —1 Sx <1. Show that there is no M such that the graph of f 
over the interval —1 S x < 1 lies entirely above the line having the equation 
y = —M. 

8 Give an example of a function which has an upper bound over the interval 
—1 £ x < 1 but has no lower bound. 

9 Show that the function f for which f(x) = x has upper and lower bounds 
over the open interval 0 < x < 1 but possesses neither a maximum nor a mini- 
mum over this interval. 

10 Show that the function f defined over the closed interval O S x S 2 by 
the formula 


Fæ) =x — [x], 


in which [x] denotes the greatest integer less than or equal to x, has an upper 
bound but does not have a maximum. 
11 Prove that there is a number x* for which a <x < b and f’(x*) = 


LE) — f(a)]/(b — a) when 


(a) fx) = x°, a =0,b=1 
(b) f(x) = x8 — 7x? + 3x + 40, a = —1,b = 1 


12 Without undertaking extensive calculations that are easily made when 
appropriate computing equipment is available, we call attention to the Newton 
(1642-1727) method by which zeros of reasonable 
functions are approximated in decimal form. 
The method is based upon the elementary obser- 
vation that, in many cases more or less like the 
one shown in Figure 5.491, if x„ (where n may 
initially be 1) is a reasonably good approximation 
to a number z for which f(z) = 0, then the tangent 

Xn+l to the graph of y = f(x) at the point (xn, f(%n)) 

Figure 5.491 will intersect the x axis at a point (%n41,0) for 

which #a;41 is a much better approximation to zZ. 

The Newton method is normally applied in cases where f has many continuous 

derivatives and f’(x) # 0 when x is near z but x Æ z. In such cases the equation 
of the tangent at (xq, f(%n)) is 


Y — Fn) = f'n) (x — Xn) 
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and setting y = 0 gives 


f(%n) . 


ntl = Xn — F (Xn) 


When the method is applied, we start with a first approximation x, set n = 1 
to calculate a second approximation xz, set n = 2 to calculate a third approxima- 
tion x3, and soon. To test the Newton method and understanding of it in situa- 
tions where computations are not too onerous to be done with a pencil, calculate 
x when 


(a) f(x) = x? — 2, xı = 1.4 
(b) f(x) = x3 — 20, x, = 3 
(c) f(x) = xt + x — 20, x, = 2 
(d f(x) = x4 + 5x — 50, x, = 2 


13 It is not always easy to tell what is obvious and what is not, the funda- 
mental difficulty being that some things that have been thought to be “‘obviously 
true” are false. Consider, for example, the “obvious”? statement that “each 
finite set of numbers contains a greatest element.” If, as is usual, the empty 
set is considered to be a finite set, the statement is false. Consider, then, the 
revised statement “each nonempty finite set S of numbers contains a greatest 
element.” Is this obviously true? Let n be a positive integer and let the num- 
bers be x1, %2, %3, °° * ,%n- The only thing we know about these things is that 
they are numbers. One possible method of attacking the problem starts with 
a comparison of x; and x2 If; S xz we discard xı and consider the remaining 
set, butif xz < xı, we discard x2. Instead of employing this “‘finite mathematics,” 
we introduce some “infinite mathematics” that will make us think about least 
upper bounds. The fact that x; S |x;| for each k implies that 


xe S for] + [xe] + - - - + [xl 


for each k. Hence the set S has an upper bound, and it follows from Theorem 
5.46 that the set has a least upper bound M. If there is a k for which x, = M, 
then this x, must be a (or perhaps the) greatest element of S. If we suppose 
that there is no & for which x, = M, and hence that x, < M for each k, we can 
obtain a contradiction of the hypothesis that S contains only n elements. Todo 
this, let yı be an element of S. Then yı < M and there must be an element y2 
of S for which yı < y2 < M. The same argument shows that there must be an 
element ys of S for which ya < ys < M, and soon. We run into a contradiction 
of the assumption after we have used n elements of S. This proves that the set 
S does contain a greatest element and provides the possibility that schemes for 
finding “it”? might even work. 

13a Remark: To put the following problems and their consequences upon 
a rigorous base, we should have a definition of the set S; of positive integers. 
This set S, can be defined to be the subset of the set S of positive real numbers 
for which the number 1 is the least element in Sı; a + b is in Sı whenever a and 
b are in Sı; b — a is in Sı whenever a and b are in S and a < b. It follows from 
this that if a and X are numbers for which O < A < 1, then the intervala S x S 
a + à can contain at most one integer. 
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14 Prove the Archimedes property of numbers: if e > 0 and a > 0, then there 
is an integer n for which ne > a. Solution: Suppose ne S a for each n. Then 
the set S of numbers e, 2e, 3€, - - -> is nonempty and has an upper bound. Hence 
S must have a least upper bound M. There must be an integer m for which 
me > M — e Then (m+ 1)e > M, and hence M is not an upper bound of S. 
This contradiction proves that there is an n for which ne > a. 

15 Prove that each nonempty set of positive integers contains a least element. 
Solution: Let S be a set of positive integers. Since S is nonempty and has the 
lower bound 1, S must have a greatest lower bound m. Let0O < e< $. ‘The 
interval m S x < m + e must contain an integer n in S, since otherwise m + e 
would be a lower bound greater than m. Since the interval has length less than 
$ and cannot contain two integers, it follows that n is the one and only integer 
in S which is less than m + e. Therefore, n is the least integer in S. Remark: 
The fact that each nonempty set of positive integers contains a least element 
will now be used to prove the following principle of mathematical induction. If 
a particular assertion involving a positive integer k is true when k = 1, and if the 
assertion is true when k = n+ 1 provided n 2 1 and it is true when k = n, then 
the assertion is true for each positive integer. Let T be the set of positive integers 
for which the assertion is true, and let F be the set for which the assertion is 
false. If Fis nonempty, then F must have a least element m which is a positive 
integer greater than 1. Then m — 1 must be in T and our hypothesis gives the 
conclusion that mis in T. Thus m is in both F and T and (on the basis of the 
tacit assumption that we are dealing with statements that are either true or 
false but not both) we have a contradiction that proves that Fis empty. There- 
fore, T contains each positive integer and the assertion is therefore true for each 
positive integer. 

16 Prove that if x is a number, then there is an integer n for which n S x < 
n+1. Remark: This property of numbers was mentioned in Section 1.1, and 
the integer n is [x], the greatest integer in x. Solution: Suppose first that x > 2. 
Using the Archimedes property of real numbers (Problem 14) with e = 1 and 
a = x shows that the set S of integers greater than x is a nonempty set of positive 
integers. Hence, as Problem 15 shows, S must have a least element m. Then 
x < m, but the inequality x < m — 1 must be false because otherwise m would 
not be the least element of S. Therefore, m — 1 £ x < m and we obtain our 
result by setting n = m — 1. In case x S$ 2, we can choose an integer k such 
that x+ k > 2. Letting m be an integer for which m Sx + k < m + 1, we 
find that m — k is an integer n for whichn Sx <n +1. 


17 Prove that if x is a number and 7 is a positive integer, then there is an 
integer k for which 


bo kt) 
n n 


Remark: In case m is a nonnegative integer and n = 10”, the result shows how 


x is related to “finite decimals.” Proof: Problem 16 shows that there is an 
integer k for which 


kRkSnx<kt+l, 


and the result is obtained by dividing by n. 
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18 Prove that if x is a number, then there exist an integer N and a sequence 
dy, do, dz,‘ * > Of digits (a digit being one of the integers 0, 1, 2, - - - , 9) such 
that 


dı d, ee dn—1 dn 
(1) N + i0 tint + iommi t jon = * 
dı dy . dn—1 dn + 1 
<N +i + fot t om t o 
for each n = 1, 2, 3,- . Solution: Let N = [x], so that N is the greatest 
integer in x, and let 0 =x — N. Then 0 S 0 < 1 and the required result will 
follow if we prove that there exist digits dı, d2, da, * © > such that 
hd d.. d, d 
a deg da htl 
a (ae (C(O os t To 


for each n = 1, 2, 3, - °°. To prove (2), it is sufficient to prove that 


dy ds do l 
< mae —— er == — ° ° e — mno ~ 
(3) 059-10 — 70: To < 10" 


While it is of interest to take time to use (2) to determine what dj, d2, ds, © © ° 


must be if they exist, we save time by defining integers dı, dz, da, * > > by the 
formulas 
(4) dı = [106] 
dy 
= 2(9 —— 
6) a = [10 (9 - 75) | 
— d d 
(6) da = | 10° (0 — 7 — =.) | 
and, in general, for each m = 1, 2,3, °°: 
dı do dn | 
= ntl [ 9 a — n =L æ e.s. — “n . 
”) dmi = [10° (0 — 99 ~ 793 To) 


Since 0 < 6 < 1, we find that O <£ 100 < 10 and hence that dı is a digit. More- 
over, 


(8) di £1090 <di+1 
and hence 

dı 1 
(9) OS 6- IO < TO 


so (3) holds when n = 1. Multiplying (9) by 102 gives 


(10) 0 < 10? (0 — 3) < 10. 
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Hence (5) shows that dz is a digit and 


(11) a <10(6-2)<a tl. 
Dividing by 10? and transposing give 
d dy 1 
(12) 0<6- 10 ~ 102 < io® 
so (3) holds when n = 2. This procedure enables us to prove (3) by induction. 
If dı, do, © © © d, are digits and (3) holds, then 
d 

(13) o<i0on(e-2 aan ~ &) < 10 
and (7) shows that dn, is a digit and 

14 dour S10 (9-H eH a) < dant] 
( ) 102 10” n+l ° 


Dividing by 10”+! and transposing give the result of replacing n by n + 1 in (3). 
This proves (3) by induction, that is, by use of the principle of mathematical 
induction of Problem 15. 

19 Let F(6) be the temperature or pressure at the place P where a circle 
having its center at the origin of an x,y coordinate system is intersected by the 
ray from the origin which makes the angle 6 (as in trigonometry) with the positive 
x axis. It is supposed that F is continuous and F(@ + 2r) = F(0) for each 6. 
Prove that there are two diametrically opposite points of the circle at which F 
has equal values. Hint: Apply the intermediate-value theorem to the function 
f for which f(@) = F(@) — F(@ + r). Observe that if f(@o) > 0, then f(@> + T) = 
—f(60) <0. Remark: While we do not yet have equipment required for proof, 
we can learn an interesting property of continuous functions defined over sur- 
faces like spheres. There are two antipodal (or diametrically opposite) places 
on the surface of the earth having both equal temperatures and equal atmos- 
pheric pressures. 

20 Suppose that a world has existed so long and so favorably to fish that an 
infinite number of fish have existed but that only a finite number of fish have 
existed at any one time because the world contains only a finite number of atoms. 
Prove that there is a least number mp such that the mass m (measured in some 
standard system) of each past and present and future fish is less than or equal to 
Mo. 

21 Itiseasy to presume that if fis differentiable over the interval —1 Sx <1 
and if f’(0) = 1, then there must be a positive number A such that f is increasing 
over the interval —h S x S h. Use the function f for which (0) = 0 and 


f(x) =x +a sinh (x ¥ 0) 


to show that the presumption is false. Hint: Show that f’(0) = 1 and that, 
when x + 0, 


, . 1 2 
f'(x) = 1 + 2x sin z3 — z cos z 


x? 
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Observe that if n is a positive integer and x, = 1/+/2nz, then 


f'(x) = 1 —2 V2nr. 


It follows that each intervalO < x < h contains subintervals over which f'(x) < 0 
and f is decreasing. 

22 Boom-and-bust processes occur (or seem to occur) in economic and political 
life. Persons who get their political information from clever press secretaries 
of astute chiefs of state discover that the fortunes of their countries are at low 
ebbs when new chiefs are installed and that these fortunes steadily improve 
during the tenure of each chief. Such processes occur in electrical engineering 
when a charge on a capacitor steadily increases until a spark jumps and the charge 
disappears. This problem involves a particular boom-and-bust process in which 
a and gq are positive constants. It is supposed that, for each integer n, the 
quantity y is 0 when ¢ = na and that y increases at a constant rate over the inter- 
val na St < (n + 1)ain such a way that y approachesg as ¢ approaches (n + 1)a 
from the left. Sketch a graph of y versus ¢ and find a formula giving y in terms 


of zt. Partial ans: 
Y= 4\a7 Lal)’ 


where [x] denotes the greatest integer in x. 

23 While persons confining their mathematical contacts to modern mathe- 
matics books need not worry about the matter, others may need a warning. 
In the good old days, the word “finite” was used in place of the word “bounded.” 
In order to understand assertions involving the word finite, it is sometimes not 
sufficient to understand modern mathematics. Sometimes we need substantial 
information about history, and sometimes we need conscious recognition of the 
fact that assertions involving the word “finite” have different meanings at different 
places and at different times. For example, the assertion that “f is finite at xo 
can mean that there is an interval with center at xo such that f is bounded over the 
interval. The assertion can, however, have other meanings, and this is the 
reason why we should shudder when we hear it. 

24 A function f is said to have a generalized first derivative Gf’(x) at x if 


(1) Gf'(®) = lim feth Aeh 


and is said to have a generalized second derivative Gf” (x) at x if 


(2) Gf” (æ) = lim 1 — h) — ahs) + f(x + h). 


Prove that Gf’(x) = f'(x) when f'(x) exists. Hint: Use the fact that 


fe th fenh 1sf~eth—f@ sh o a, 
(3) 7h =3| h + -h ] 


Remark: The wide world contains several persons who have sharpened their wits 
by trying to answer two questions which are not guaranteed to be easily answered. 
Does the hypothesis that Gf’(x) = 0 when a < x < b imply that there is a 
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constant ¢ for which f(x) =c when a<x <b? Does the hypothesis that 
Gf” (x) = 0 when a < x < b imply that there are constants cı and ce for which 
f(x) = cix +ceewhena<x <b? 


5.5 The Rolle theorem and the mean-value theorem In this 
section we prove some fundamental theorems and use them to review and 
prove some theorems that have been previously given without proof. 
The following theorem must be permanently remembered and known as 
the Rolle (Michel, 1652-1719) theorem. It is not to be presumed that 
Rolle proved or even knew this theorem, but he did discover some of its 
applications to polynomials. 

Theorem 5.51 (Rolle theorem) If a< b, if f ts continuous over 
a Sx Sb, if f is differentiable over a < x < b, and if f(a) = f(b) = 0, 
then there is at least one number x* for which a < x* < b and f'(x*) = 0, 

The proof of this theorem is mildly tricky because it seems to be 
necessary to consider three different cases. Suppose first that f(x) = 0 
over the whole interval a S x <S b. Then f’(x) =O when a<x<b 
and we can choose x* to be any number between a and b. Suppose next 
that there is a number x, for which a < x, < b and f(xı) > 0. Then 
with the aid of Theorem 5.47 we see that f must attain a positive maxi- 
mum f(x*) at some point x* for which a S x* < b, and we can be sure 
that a < x* < b because f(a) = f(b) = 0. Since f'(x*) exists, it follows 
from Theorem 5.26 that f’(x*) = 0. Suppose finally that there is a 
number xz for which a < x < b and f(x2) < 0. Arguments similar to 
those used above then show that f must have a negative minimum at 
some point x* for which a < x* < b and that f’(«*) = 0. 

The following theorem is known as the law of the mean or the mean- 
value theorem of the derivative calculus. It is a strengthened version of 
Theorem 5.41, which we have discussed briefly, the right member of 
(5.53) being the slope of the chord joining the points (a, f(a)) and (b, 
f(b)). It, like the Rolle theorem, must be permanently remembered. 

Theorem 5.52 (mean-value theorem) If a < b, if f is continuous 
over asx Sb, and 1f f is differentiable over a < x < b, then there 1s at 
least one number x* for which a < x* < b and 


or 
(5.54) fO) — fla) = f°") — a). 


This theorem differs from the Rolle theorem because it is not assumed 
that f(a) = f(b) = 0. It happens, however, that the theorem can be 
proved by applying the Rolle theorem to the function @ defined by 
o(x) = f(x) — g(x), where g is the function whose graph is the chord 
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f(b) 


a x* b x 


Figure 5.55 


joining the points (a, f(a)) and (b, f(b)) of Figure 5.55. The point-slope 
form of the equation of a line gives the formula 


(5.561) g(x) — fla) -AA ( a) 


in which g(x) appears instead of the more familiar y. Hence, 


when a S x S$ b and 
(5.563) o'(x) = f(x) — w ~ He) — Ke) 


whena <x <b. Itis easily seen that ¢ satisfies the hypotheses of the 
Rolle theorem. Choosing x* such that a < x* < band ¢’(x*) = 0 gives 
the required conclusion (5.53). Multiplying by (6 — a) then gives 
(5.54), and Theorem 5.52 is proved. 

Theorem 5.57 Iff ts continuous over a S x < b and f'(x) = O when 
a<x <b, then f(x) = f(a) whena S x sb. 

To prove this theorem, we note first that f(x) = f(a) when x =a. If 
a<%*1 Sb, we can apply the mean-value theorem to the interval 
a Š x S x; to conclude that there is a number x* for which a < x* < x 
and 


f(s) — f(a) = f'(%*) (a1 — @). 


But f’(x*) = 0 and hence f(xı) = f(a). Therefore, f(x) = f(a) when 
a Š x S band Theorem 5.57 is proved. It follows from this theorem that 
if two functions F; and F, have the same derivative over an interval, say 


F(x) = F(x) = g(x) when a < x < b, and we put f(x) = F(x) — Fi(x), 
then f'(x) = 0 when a < x < b so f(x) must be a constant ¢ and 


F(x) = F(x) -+ C 


when a S x <b. This proves Theorem 4.13. 
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We now prove Theorem 5.27 the first part of which says that if f is 
continuous over an interval a) S x S do and f'(x) > Owhen a < x < bo, 
then f is increasing over the interval ao Sx S bo. Leta S xi < x: S 
bo. The mean-value theorem guarantees existence of a number x* such 
that xı < x* < x: and 


f(x) — f(x) = F (%*)(%2 — x1). 
But f'(x*) > 0 and (x2 — xı) > 0, so f(x) — f(xı) > 0. Thus f(x) > 


f(x:) and f is increasing. For the second part of Theorem 5.27, every- 
thing is the same except that f’(x*) < 0 and f is decreasing. 

The following theorem expresses the fact that if a function f is con- 
tinuous over a closed interval a S$ x S b, then it is uniformly continuous 
over the interval. 

Theorem 5.58 If f 1s continuous over a closed interval a S x S b, then 
to each e > O there corresponds a ô > O such that 


|f) — f(x)| < € 


whenever a S xı S b,a S xz S b, and |x — x| < ô. 

While neater proofs of this theorem can be given in advanced calculus 
after more mathematics has been digested, there is virtue in knowing that 
it is possible to base a proof upon a straightforward application of the 
Dedekind axiom 5.43. The bookkeeping by which we inch along toward 
the answer is really very elementary, and students who have the patience 
to see that this is so are very likely to become the leading scientists of the 
future. Let e be a given positive number. Let a number x be placed in 
the set 4 if x S a and also if a < x < b and there is a positive number 6 
such that |f(xe) — f(xı)| < e whenever a S xı S x, a S xz S x, and 
Ixo — xı| < 6. Let B contain each number x not placed in 4. Let £ be 
the partition number. Clearly, a S £ < b. Since f is continuous at a, 
we can choose a positive number 6, such that |f(x) — f(a)| < e/2 when- 
evera Sx <S<a-t ôn Then 


FE) — fedl = e) — f(@] e) — al 
< |f) — f@| + Ife) -Altas 


whenever a S xı S a + ôa S x2 S 61, and |x2 — x,| S 5). Therefore, 
a + ô, belongs to 4 and it follows that £ > a. Our next step is to prove 
that =b by obtaining a contradiction from the assumption that 
a< <b. Suppose, then, thata < < b. Since fis continuous at ¢, 
we can choose a positive number ôz such that a < £ — ô: < E + z <b 
and |f(x) — F(E)| < e/2 whenever |x — | < ô Moreover, since £ — êz 
must belong to 4, we can choose a positive number ô; such that 63 < 8 
and |f(xe) — f(xı)| < e/2 whenever a S xı §—-—6&, a S x: S & — by, 
and |xe — xı| < 63. Now suppose that a S xı < E + ôn a L x2 SEH 


5.5 The Rolle theorem and the mean-value theorem 327 


So, and |x2 — xı| < ôs. We may suppose that xı S xo. If xı and xo 
both lie in the interval |x — | S 8, then |f(x2) — f(x:)| < e If xı and 
x2 both lie in the interval a S x S & — 8, then again |f(x2) — f(x1)| < €. 
Ifa S xı S — db: and & — ô: < x2 S E+ ô then 


| f(a) — F| S (flee) — f(E Sal + [f(E — 82) — f(x] < 5+ 5 = e. 


Thus, in each case, |f(x2) — f(x1)| < e, so £ + ôzis in 4 and £ cannot be 
the partition number. This contradiction shows that £ = b, but to 
complete the proof of the theorem, we must show that b is in the set A. 
Since f has left-hand continuity at b, we can choose 8’ > O such that 
a<b—8 <b and |f(x) — f(b)| < e/2 whenever b- 8’ <x <b. 
Since b — 6’ must belong to 4, we can choose a positive number 6 such 
that ô < 6’ and |f(x2) — f(xı)| < ¢/2 whenever aSuiSb—S8, as 
xo = b— 8’, and |x, — xı| < 6. Arguments used above show that 
|f(x2) — f(xı)| < e whenever a S xı < b, a S x S b, and |x: — xı| < ô. 
This completes the proof of Theorem 5.58. 


Problems 5.59 


1 With the text out of sight, write completely accurate statements of (a) 
the Rolle theorem, (b) the mean-value theorem. 

2 Sketch several graphs that seem to be graphs of functions satisfying the 
hypotheses of the Rolle theorem, and see whether it seems to be true that the 
star points exist. 

3 Sketch several graphs that seem to be graphs of functions which, for one 
reason or another, do not satisfy the hypotheses of the Rolle theorem but never- 
theless it seems to be true that star points exist anyway. 

4 Sketch several graphs that seem to be graphs of functions which, for one 
reason or another, do not satisfy the hypotheses of the Rolle theorem and star 
points do not exist. 

5 Tell why there is no request for construction of graphs of functions that 
satisfy the hypotheses of the Rolle theorem and star points do not exist. 

6 Prove that if F’(x) > 0 when a < x < b, then there is at most one x 
for which a < x < band F(x) =0. Solution: If we suppose that a < xı < x: < 
b and F(x) = F(x2) = 0, an application of the Rolle theorem yields the con- 
clusion that there is a number & for which x; < £ < x and F’(£) = 0. This 
contradicts the hypotheses and the result follows. 

7 This problem requires us to review fundamental processes of the calculus 
whose validity depends upon Theorem 5.57. Supposing that f and g are func- 
tions defined and continuous over an interval containing the point x = a and 
that 


(1) f@) = g(x), fa = 4, 
we can then write 


(2) f(x) = Sex) dx + c, 
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where {g(x) dx stands for some particular function whose derivative is g(x), and 
then so determine c that f(a) = 4. We can also determine f from the formula 


(3) fle) =f) + fre a 


in which the integral is a Riemann integral. Determine f in two different (or 
superficially different) ways by using (2) and by using (3), and make the results 
agree, when 


(a) f'(x) = 2x, f(2) = 3 (b) f'(x) = sin ax, f(0) = 0 
(c) f'(x) = cos ax, f(0) = 0 (d) f'(x) = e7, f(0) =1 
(6) @) = + f(2) = 3 H e) = Va f(a) = 0 


8 Prove that if « and v are functions that have continuous derivatives over 
an interval J containing a and x, then 


[eee & = ao E- [Poe a. 


Hint: Let F(x) and F(x) denote the left and right sides of the formula. Then 
show that F,(a) = F(a) and that Fy(x) = F3(x) when x is in J. 
9 If f() =O and 


i x27 
FE) = 7a 


the result of writing formula (2) of Problem 7 as an aid to finding f(1) is rather 
(or more) futile, but we can write a version of (3) and undertake to estimate 
Ja). Do it. 

10 Sketch a graph over the interval 0 S x S 1 of the function f for which 
f(x) = x? Lete = %. Use your eyes to select a ô > 0 such that | f(x) — f(«1)| 
< e whenever 0 S$ x; <S 1,0 S x: S 1, and |x2 — xı| < 6. Note that if this f 
were the only continuous function, we would not need to work so long to prove 
Theorem 5.58. 

11 Supposing that f has a continuous derivative over the intervala Sx < b, 
show that the functions F and G for which 


Fa) =f +4 [704+ P04 Ole 
Ge) =4 fn -fO+ rola 
are both increasing over the interval a S x S$ b and 


fœ) = F(x) — G(x). 


Remark: This problem contains an important idea. It is sometimes useful to 
know about the possibility of representing a given function as the difference of 
two increasing functions. 

12 Prove the following theorem, which is known as an extended (not gen- 
eralized) mean-value theorem or as a Taylor theorem. 
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(1) Theorem [f f is such that f" exists over an interval containing a and x, 
then there is at least one number x* between a and x such that 


(2) ga) = fla) +O @ a 42 @O & — 2 
Solution: Let 
(3) 6) =f) — 10 — FP @~9- Se -m 


where C is a constant chosen such that ¢(2) = 0. Then ¢(a) = ¢(x) = 0 and 
the other hypotheses of the Rolle theorem are satisfied. The Rolle theorem 
therefore furnishes a number x* between a and x for which @’(x*) = 0. Thus, 


(4) p (%*) = =F E +H *) — a*a — x*) + Ce — x*) = 0. 


Therefore, C = f’’(x*). Since (a2) = 0, we can put ¢ = a in (3), equate the 
result to 0, and transpose to obtain the required formula (2). Remark: With the 
additional hypothesis that the second derivative f” is continuous, we shall use 
integration by parts in Section 12.5 to obtain more straightforward derivations 
of (2) and related formulas. 

13 This problem requires attainment of understanding of matters relating to 
the following generalized mean-value theorem which involves two functions. 

(1) Theorem Let f and g be continuous over the closed interval from a to x, 
let f and g be differentiable over the open interval from a to x, and let the derivative g’ 
be different from zero over the open interval from ato x. Then there exists an x* 
ın the open interval from a to x such that 


Iœ) — fla) _ f” 
g(x) — g(a) a’ (x*) 
We assume that f and g are given functions satisfying the hypotheses of the 


theorem. Two applications of the mean-value theorem then show that there 
exist numbers x; and xg between a and x such that 


(2) 


f(x) — F(a) 
(3) fle) = f@) _ ema P, 
g(x) — g(a) g(x) — g(a)  g'(x3) 
x— a 


While this result can be useful, it is crude because x; and x3 are not necessarily 
equal. We obtain the more useful and elegant result (2) by arranging our work 
to make a single application of the Rolle theorem. The trick is to define a new 
function @ by the formula 


(4) pO = F) — f@lle® — e(@] — YO — f@lle@) — gla). 


It is easy to see that (a) = (x) = 0 and that the other hypotheses of the Rolle 
theorem are satisfied. Hence the Rolle theorem implies that there is a number 
x* between a and x for which 


(5) ¢'(t*) = æ) — f@le’@*) — fee) — ela) = 9, 
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and the desired formula (2) follows from this. The generalized mean-value 
theorem has numerous applications, including the following one. Since x* lies 
between a and x, (2) implies that 


fn £0) ~L) = in LO) 


(6) za g(t) = gla) aag (2) 


provided the limit on the right exists. 
For the special case in which f(a) = g(a) = 0, (6) reduces to the famous and 
useful L’ Hôpital formula 


tm £2) = im LO) 
”) pas g(x) mre gG) 


which, like (6), is correct when the limit on the right exists.{ In this case the 
quotient f(x)/g(x) is said to be “‘an indeterminate form of the form 0/0 when 
x =a.’ The formula (7), which gives a method for finding limits of “‘indeter- 
minate forms,” is called “the L’Hépital rule for evaluation of indeterminate 
forms.” Stories about “‘evaluation of indeterminate forms” will not injure us 
if we resolutely remember that we sometimes find limits but that we never 
evaluate 0/0. When we apply the L’Hépital formula (7), we must not fall asleep 
at the switch and write the derivative of the quotient f(x)/g(x); we write the 
quotient of the derivatives. The following rather simple examples show how 
the formula is applied: 


. x?—1 2x 
® inating to? 
(9) lim 2% = lim S =] 
z—>0 «x z—0 1 
(10) lim LZ CSF _ lim BX = 0 
z—0 x z—0 1 
(11) lim Lo OOS * = lim SIN-* = Jim COS* _ 1 
20 x? 20 2x z—=0 2 2 
(12) lim ZZ! = lim Č =1 
z—0 x z0 1 
. e—-1l—x . e=—1 . g 1 
13 lim {& > = = eat 
a3) lim = lim = ling = 5 
(14) lim (— 1) = lim = 8B * 
z=»0 \sinx x e330 xsinx 
= lim 1 — cos x = lim sin x _%_o 
z0 x cosx + sinx pxo2cosx—xsinx 2 ` 


14 Show that 
lim ntl — (nt Iet _ naat). 
2 


z—>1 (x — 1)? 7 


t Guillaume Francois de |’Hospital (1661-1704) introduced this formula in a book 
Analyse des infiniment petits pour intelligence des lignes courbes (Paris, 1696, 182 pp.) which 
enjoyed widespread popularity. While there can be objections to tinkering with names of 
people, most authorities insist that we must accept evolution from |’Hospital to L’Hépital 
quite as cheerfully as we accept evolution from hostel to hotel. Even counterrevolutionists 
must recognize that the name is spelled in different ways. 
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15 Supposing that n is a positive integer and x 1, differentiate 


2 3 e à „2t -l 
(1) Pte tx?+ x? t + et = 
to obtain 
„1 _ nx"H — (n +1)” +1 
(2) ee ooo Hna a . 


Multiply by x and differentiate again to obtain 


(3) 12 4+ 22% + 3%? + + + + + nxn 
_ meant? — (2n? + 2n — l)et + (n +1 — x — 1 
(x — 1)? 


Finally, take limits as x — 1 to obtain the formula 


Remark: We could multiply (3) by x and differentiate and continue the process 
to obtain formulas for sums of cubes and higher powers. The details mushroom 
rapidly as we proceed. 

16 Another L’Hépital rule is embodied in the following theorem. 

(1) Theorem If f and g are differentiable over the infinite interval x 2 xo 
and uf 


(2) lim f(x) = ©, lim g(x) = œ, 
then 


zo g(x) so g'(x) 


provided the limit on the right exists. 


To prove this theorem, suppose that the right member of (3) exists and is L. 
Let 


(4) (x) = f(x) — Lex). 

Then 

(5) lim 2% = lim L) = Le) _ 9 
aoe g'(x) ge g'(x) l 


Let e > 0. Choose x; such that xı > xo, g(x*) > 0 when x* 2 xı, and 


It then follows from the generalized mean-value theorem of Problem 13 that 


£ 
2 


(x* Z xı). 


lx) — o| E x > x). 
4 g(x) — g(a) | 2 (> a) 
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Choose x2 such that x: > xı and g(x) > g(xı) when x > x2. It then follows 
from (7) that 


(8) | $e) oe |< § (> x), 
Choose x3 such that x3 > x2 and |@(x;)/g(x)| < €/2 when x > x3. Then 
b(x)| | dz) — der) , dle) | | $@) — dled] , | olen 
® fein w+ fo ls | ce)  |*\e@) |< 
when x > x3 Therefore, 
im [OL] tim 9 = 
(19) im [oq ~£| > n pa 7° 


and this gives the required conclusion (3) which involves “indeterminate forms 
of the form 0/0.” The following rather simple examples show how the theorem 
is applied. 


. x - Ll _ 
aD eFI ane De 
. 2 . 2x . 2 
lim —_*~—_ = _** -> lim 4 =0 
(12) am Shadi 2M jedi ~ 2 
1-3 
(13) lim a+ Vx = lim Ita” 1 
1 © x + 1 T> O 1 
- logx  , «wt a 1 _ 
as) im a Tm pei T n pe 0 (2 > 0) 
(15) lim tlogz = lim llog} = lim log xe lim Uf 0. 
t—-0+ m0 X x z— o x z— © 1 


Remark: Limits of functions of other types can be found by using the above 


formulas. For example, to find lim x7, we put y = x” and find that logy = 
z—0+ 
x log x, so lim logy =0, lim y = 1, and lim x? = 1. Similar arguments 
z—0+ z—0+ z—0+ 
show that lim «¥/* = 1. 
rr © 


17 Supposing that f, g, A are three functions satisfying the hypotheses of the 
mean-value theorem, show that the function F defined by the first of the formulas 


f) g(x) h(x) F) gl(%*) h (x*) 
fla) gla) h(a) fla) gla) ha) |=0 
f) g() hC) f(6) è) hb) 


satisfies the hypotheses of the Rolle theorem and hence that there is a number 
x* for which a < x < b and the second formula holds. Examine the case in 
which h(x) = 1 for each x. 

18 This problem provides an opportunity to learn some very interesting 
mathematics but, like a bicycle rider who lacks appreciation of basic principles 
of physics and engineering, we can pedal along withoutit. The following theorem 
is a fundamental theorem of the calculus which is stronger than Theorem 4.37 
because it does not require that f be continuous. 


F(x) = 


? 
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Theorem If f is integrable (Riemann) over a S x Sb and if F'(x) = f(x) 
when asx S b, then 


(1) MEG dx = Fe) |, = F(b) — F(a). 


Our proof of this theorem uses the mean-value theorem. Supposing that P 
is a partition of the interval a S x < b having partition points xo, x1, °° * , Xn 
for which 


(2) a= xox <e SH <i = b, 


we find that 


G) FO) — Fla) = Ù [Fle — Fen)] = yr "Gel (ar — xem) 


k=1 


when, for each &, zk, isan appropriately chosen pointin the interval xk-1 < x < xp. 
Since F' (xz) = flap), the last sum is a Riemann sum formed for the function f 
over the interval a S x S b. Since each sum is equal to F(b) — F(a), it follows 
that the limit (which exists by hypothesis) of these sums must be F(b) — F(a). 
The limit is the integral in (1) and the theorem is proved. Remark: The hypothe- 
sis that F is differentiable and F’(x) = f(x) over a £ x < b does not imply that 
f is continuous over a S x < b; in fact the last of Problems 3.69 gives examples 
of functions which are differentiable over the interval —1 < x < 1 but have 
derivatives that are unbounded over this interval. Thus, some discontinuous 
functions can be derivatives, but the following theorem shows that a function 
cannot be a derivative unless it (like continuous functions) possesses the inter- 
mediate-value property. 

Theorem If F is differentiable over a S x S b and if F'(a) < q < F'(b) or 
F'(a) > q > F'(b), then there is a number & for which a < — < b and F'(Ẹ) = q. 

To prove this theorem, let g(x) = F(x) — g(x — a). Then g, like F, must be 
continuous. Hence g(x) must attain a minimum value at some point £ for which 
asxé£&xb. Consider the case in which F(a) < q < F'(b). Since g’(a) = 
F'(a) — q < 0, we see that £ cannot be a. Since g'(b) = F'(b) — q > 0, we see 
that £ cannot be b. Hence a < £< b and therefore g'(£) = 0 and F’(&) = q. 
In case F’(a) > q > F'(b), g(x) must attain a maximum at a point & for which 
a<&< band F’(£) = gq. This proves the theorem. 

19 Prove that if f is continuous over — œ < x < œ, if f(x) ~Oasx— œ, 
and if f(x) +0 as x > — œ, then f must have a global maximum or a global 
minimum but not necessarily both. Hints: As in the proof of the Rolle theorem, 
consider three cases. In case f(x) > 0 for at least one x, choose xo such that 
f(x) > 0. Choose a number H such that |f(x)| < f(x) when |x| 2 Æ. 
The maximum of f(x) over the interval |x| S H must then be the maximum of 
f(x) over the infinite interval. 

20 Persons who manufacture peanut butter and typewriters and electronic 
organs have an abiding interest in demand curves. It is supposed that x units 
of a commodity can be sold when the price is p(x) dollars per unit. The graph 
of p versus x is the demand curve. The nature of the demand curve depends 
upon the commodity, being relatively flat (or inelastic) for false teeth, since 
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people are prone to purchase only those that are required no matter what they 
cost, and being relatively steep (or elastic) for water, since people require only 
enough to drink but like to wash everything and water their gardens when water 
is cheap. Economists and others construct and study hypothetical demand 
curves for pleasure and for business. It is usually supposd that is a positive 
decreasing differentiable function of x. When x units are sold at p(x) dollars 
per unit, the total revenue R(x) is the product of x and p(x). Thus R(x) = xp(x). 
When x units are sold, the profit P(x) is R(x) — C(x), where C(x) is the total 
cost of producing and selling the x units. Thus 


(1) P(x) = R(x) — C), 


this being one way of saying that profit is obtained by subtracting expenses from 
income. For better or for worse, economists use special terminology in studies 
of price, revenue, cost, and profit. The numbers p(x), R’(x), C’(x), and P’(x), 
these being derivatives at x, are called the marginal price, the marginal revenue, 
the marginal cost, and the marginal profit. This terminology is (or is thought to 
be) appropriate because if we are producing and selling x units and we know p(x), 
R(x), C(x), and P(x), then a shift to x + Ax, p(x + Ax), R(x + Ax), C(x + Ax), 
and P(x + Ax) is “marginal” when Ax is near zero and, for example, the number 
which [P(x + Ax) — P(x)]/Ax approximates for marginal shifts is a marginal 
profit. As is easily imagined, knowledge of functions, limits, and derivatives 
is helpful when these things are being studied. Differentiating (1) gives the 
formula 


(2) P'(x) = R(x) — C(x), 


which says that the marginal profit is equal to the marginal revenue minus the 
marginal cost. When, as frequently happens, P(x) is a maximum when P’(x) = 0 
and there is just one x for which P’(x) = 0, we obtain the following rule for mazi- 
mizing profits: choose the x for which the marginal cost is equal to the marginal 
revenue. When equations of demand curves and cost curves are given, we can 
determine the x that maximizes profits. Our course in analytic geometry and 
calculus is considered to be a prerequisite for extensive study of economics 
because it prepares us to understand definitions, work out formulas, solve prob- 
lems, and attain over-all comprehension of the subject. In fact, knowledge of 
the mean-value theorem is not superfluous. The formula 


(3) P(x +1) — P(x) = Tety i = P' 


(x*), 

in which x* is an appropriate number between x and x + 1, can help us under- 
stand the antics of elementary books that alternately use P(x + 1) — P(x) and 
the slope of the graph of P for the marginal profit. 


5.6 Sequences, series,and decimals Our mathematical foundations 
always remain quite shaky until we obtain precise information about the 
possibility of approximating and “representing” numbers by decimals. 
Moreover, we should have some solid information about this “repre- 
senting” business. We know what we mean when we say that lawyers 
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represent felons in courts of law, but nevertheless our precious corpus of 
scientific information is not appreciably augmented when a solemn tutor 
makes the unexplained statement that “‘decimals represent numbers.” 

To attack this and other matters, we must learn about some things that 
have many applications. A sequence 51, 52, 53, © * * of numbers is an 
ordered collection of numbers in which there is a first, a second, a third, 
etcetera. The individual numbers are called elements of the sequence; 
they are not called terms because terms are things that are added, and 
they are not called factors because factors are things that are multiplied. 
When 51, 52, 53, © * * is a given sequence, it may be true (or it may be 
false) that there is a number L such that s, is near L whenever 7 is large. 
This statement is meaningful. It means that when sı, 52, 53, °° > is a 
given sequence, it may be true (or it may be false) that there is a number 
L such that to each e > 0 there corresponds an integer N such that 
ls, — L| < e whenever n > N. Incase L exists, we write 

lim s, = L, 
N> 00 

as in Section 3.3, and we say that the sequence converges to L. Incase the 
limit does not exist, we say that the sequence is nonconvergent or divergent. 

As we shall see, the elementary theories of sequences and series are 
closely related. However, a series is very different from a sequence. A 
series (or simple infinite series) is an array of numbers and plus signs of the 
form 


(5.61) ui + uz + ust te. 


Because the notion of addition is involved, the numbers ui, we, u3, °° ° 
are called terms of the series. The terms are not necessarily nonnegative, 
and it is standard practice to write the series 


L+(-P+4+(-D+ht(-P+--> 
in the form 


L-$4+4-24+4-S4-°°. 


Our series u; + ue + uz + * * * contains so many terms that not even a 
high-speed electronic computer could ‘‘add them all up” during its life- 
time. In order to determine a number that can reasonably be called the 
value of the series, we need a procedure involving more than brute-force 
addition. While other procedures exist and are useful, the following is the 
most elementary and best known useful procedure. Let the sequence 
S1, 52, 53, °° * of partial sums be defined by the formulas sı = u, 
S2 = Uy + u2, 53 = U1 + uz + uz, etcetera, so that 


(5.62) fatter ti A (n = 1,2,3,°°°). 
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If it happens that this sequence of partial sums converges to s, so that 


(5.621) lim Sn = S, 

n> 0 
then we say that the series converges to s, and, leaving the significance of 
the horrendous operations to be revealed in Problem 6 and Chapter 12, 
we say that the series has the sum s and we write 


co 


(5.622) S= u, tH uz: +t uz + > or s= > um. 


k=1 


In case a given series is not convergent, we say that itis divergent. The 
series 


1-14+1-141-14--- 


is a classic example of a divergent series. 

We are now ready to attack decimals. Let dı, do, dz, © be a 
sequence each element d, of which is one of the 10 digits 0, 1, 2, 3, 4, 5, 
6, 7,8,9. The array 


(5.63) O.dided3- °°, 


in which the first dot is a decimal point, is then an infinite decimal. We 
confine our attention to decimals of this form; presence of a positive 
integer before the decimal point causes no difficulties. Just as the left 
side of the equation 


6 0 


3 1 9 
0.31690 = 75 + imt+ T0 + Io t io 


is a remarkably efficient way of abbreviating the right side, so also (5.63) 
is a remarkably efficient way of abbreviating the infinite series 


dı dz ds ce 
(5.631) 10 t I0 t ios t 


Thus the infinite decimal is an infinite series in disguise. 


Theorem 5.64 Each infinite decimal O.dıdodz - - + converges to a 
real number s. 


If we think it will serve a useful purpose, we can say that the decimal 
“represents” the number to which it converges. In any case, we write 


(5.641) s = O.d,ded3 -- > 


when the decimal converges to s. To prove the theorem, let s, denote 
the sum of the first n terms of the series (5.631) so that 


od d; da 
sa = i0 tig t e.. >? + Tox 


and 
Sn = 0.d ida . foe da. 
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The set S consisting of the numbers 5), 59, 53, - © - is then nonempty and 
has the upper bound 1, sinces, < 1 for each n. Therefore, Theorem 5.46 
implies that S has a least upper bound which we denote by s. Then 
sn S Sforeachn. Toeache > 0 there corresponds an index N such that 
sy > 5 — €, since otherwise s — e would be an upper bound of S less 


than s. But the numbers dj, do, d3, - > + are all nonnegative, and hence 
s—e<5, 85 when n > N. Therefore, lim s, = 5 or s = 0.d d-d; 
N> O 


- and Theorem 5.64 is proved. For future reference, we note that 
very minor modifications of this proof yield proofs of the following two 
theorems. 

Theorem 5.65 If the terms of the series uy + us + uz; + ©- - are 
nonnegative and if the sequence of partial sums has an upper bound, then 
the series 15 convergent. 

Theorem 5.651 If a sequence sı, 52, 53, - - - is monotone increasing 
(that is, 5m S Sn when m < n) and bounded above (that is, 52 S M for each 
n) then the sequence is convergent. Similarly, each monotone-decreasing 
sequence which 1s bounded below must be convergent. 

In connection with Theorem 5.64, it is often necessary to recognize the 
awkward fact that two different infinite decimals can converge to the 
same number. For example, 


+ = 0.250000 --- , 4+ = 0.249999 ---. 


This situation occurs, however, only when one of the decimals has only 
nines from some place onward. In Theorem 5.64 we started with a deci- 
mal and found that it converges to a number. The next theorem is 
different; we start with a number and find a decimal which converges to it. 

Theorem 5.66 If sis a number for which O < s < 1, then there is a 
decimal 0.d,ded3 > © > which converges to it. 

Our proof of this theorem involves manipulation similar to the manip- 
ulations of Problem 18 of Problems 5.49, where more details are given. 
Let dı be the greatest integer for which 0.d; Ss. Then s — 0.1 < 
0.d; S s. Let d} be the greatest integer for which O.dıd < s. Then 
s — 0.1? < O.dıdz < 5. Let d3 be the greatest integer for which 0.d,ded3 S 


s. Then s — 0.13 < 0.d,;d.d3 S s. Continuation of this procedure 


yields a decimal O.dided3 - + + that converges to s so that 
= O.djdod3 ` - - 


We conclude this section with a study of geometric series and repeating 
decimals. When x = 1, the identity 
1 — x" 


(5.67) po lteter te ta 


can be proved either by long division or by multiplying by 1 — x. When 
|x| < 1, the sequence |x|, |x|?, |x|, - - - is monotone decreasing and 
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bounded below by O and hence must have a limit. If we let L denote 
this limit, then 


L = lim |+! = |x| lim |x|” = |x|, 
so (1 — |x|)Z = 0 and hence L = 0. Therefore, as we have previously 
proved in another way, 
(5.671) lim x" = 0 (|x| < 1). 


But the right member of (5.67) is the sum of the first n terms of the series 
in the right member of the formula 


alte tattatt ee. (|x| < 1). 


(5.672) 


Hence, when |x| < 1, taking the limits as n becomes infinite of the mem- 
bers of (5.67) gives (5.672). Multiplying the members of (5.67) by a 


constant a gives the very important formula 


(5.673) = a + ax + ax? + ax? t `t (jx| < 1) 


a 
l-—x 
which must be permanently remembered. The series is a geometric series 
with ratio x, the ratio being the factor by which we multiply one term to 
get the next. The easy way to remember the formula is to remember 
that, when the absolute value of the ratio is less than 1, a geometric series 
converges to the first term divided by 1 minus the ratio. 
A repeating decimal is one, like 


3.16952 952952 +», 


in which, from some place onward, the digits involve only periodic 
repetitions of a collection containing one or more digits. With the aid of 
(5.673) we can show that each repeating decimal converges to (or is) a 
rational number, that is, a quotient of two integers. For example, if s 
is the number to which the decimal displayed above converges, then 


316 1 952 952 
s = +> (952 + 2 000 * (1000)? e) 


100 ` 100 
316 1 952 316 1 92 


~ 100 ' 100 1— -vso 100 10099 


It is presumed that we can add fractions when there is a reason for doing 
so, and we can see that s is a quotient of integers with denominator 99900. 
The most important fact concerning repeating decimals is set forth in 
the following theorem. 
Theorem 5.68 The (terminating or nonterminating) decimal expansion 
of each rational number is a repeating decimal. 
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Proof of this fact can be based on the ordinary process by which “long 
division” is used to divide one positive integer, say P, by another, say Q. 
At each sufficiently advanced stage of the process, we obtain a representa- 
tion of P/Q of the form 
(5.681) 5 = N+ 0.did2 +++ dat ivy 
where N is an integer, the d’s are digits, and Pa is an integer remainder 
for which O = Pa < Q. After the long division has progressed past the 
place where no digits other than zero are ‘‘brought down,” the remainders 
and hence also the @’s run through cycles that produce the repeating 
decimal. A cycle begins when a remainder becomes equal to a previous 
remainder, and this must happen because 0, 1, 2, -- - , Q — 1 are the 
only values that remainders can have. Dividing 365 by 7 shows an 
application of the ideas. The long division process never produces 
decimal expansions which, from some place onward, consist exclusively of 
nines, but these expansions are clearly repeating decimals. 

The elementary arithmetical consequences of Theorem 5.68 are enor- 
mous. We can easily write nonrepeating decimals, examples being 


0.1234567891011121314151617 - - - 
where the positive integers are written in order, and 


0.101001000100001000001 ---. 


These decimals converge to real numbers that are not rational and are 
called irrational (not ratio-nal). This proves existence of irrational 
numbers. Moreover, we can easily generate the idea that if the digits in 


O.d,deds3d4 oo. 8 


are selected in some random way, then it is highly unlikely (or even 
almost impossible) that the decimal would be a repeating decimal. 
This leads us to the idea that “almost all” real numbers are irrational, and 
there are different ways in which this idea can be made precise. 


Problems 5.69 
1 Show that if a, b, and c are digits, then 
(a) O.aaaa--- =5 (b) O.ababab - - - = Watt 
(c) baaaa +++ =~" D cababab - - - = Dte tt 
(¢) O.baaa +». =~ t8 (f) O.cabab -.- = Etat? 
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2 Write an infinite decimal which converges to an irrational number between 
0.43211 and 0.43212. 

3 Supposing that 0.31690416 - - + and 0.31690444 - - - converge to irra- 
tional numbers, write a rational number that lies between them. 

4 Supposing that a and b are different positive numbers, give a procedure 
by which we can find a rational number x; and an irrational number x, that lie 
between a and b. 

5 For a long time before the advent of electronic computers, the base 10 of 
the decimal system reigned supreme and most people thought that other bases 
had only theoretical interest. Nowadays the base 2, which employs the two 
binary bits O and 1 instead of the ten decimal digits 0, 1,2, © - + ,9, is very impor- 
tant. In the binary system, the left member of the formula 


(1) bmbm—1 ` © © bobibo = bm2™ + Dmi12™ 1) + + + * + 02? + 312 + do, 


in which each bit b+ is O or 1, abbreviates the right member. Thus the binary 
representations of the first few positive integers are 


(2) 1, 10, 11, 100, 101, 110, 111, 1000, 1001, + + +. 
Similarly, 

bı ba bg. b 
(3) (bibb t h F tata tate’ 


where the subscript 2 in the left member informs us that the “point” is not a 
“decimal point” but is a “binary point” and that each b is a binary bit. One 
reason for importance of binary bits lies in the fact that one “state” such as 
“light on” or “switch closed” or “true” can be represented by 1, while the oppo- 
site “‘state”’ such as “light off” or “switch open” or “‘false’’ can be represented 
by 0. Perhaps without knowing why, we can pick up useful ideas by solving 
a few simple problems. Show that 


(a) (29)10 = (11101): (b) (100)2 = (4)10 

(c) (100)10 = (1100100). (d) (416)10 = (110100000). 
(e) (10011)2 + (10110). = (101001): (f) (z) = (0.00111)2 

(e) ($)10 = (0.01010101 - - -)e 


Remark: Many persons with substantial lacks of enthusiasm for adding, sub- 
tracting, multiplying, and dividing with decimal digits can find genuine amuse- 
ment in learning to make these manipulations with binary bits. Scientists need 
never be bored because of lack of interesting things to do. 

6 Inquisitive students may ask why we write 


(1) S= Uy tuz tuz t.. 


when the series converges to s. The answer lies partly in the fact that it is much 
easier to write (1) than to write the statement that “‘s is the number to which the 
series vı + uz + ug + ° * - converges” and partly in the fact that the method 
of convergence which we have described is the simplest useful method for assigning 
values to series. There are other methods that are both venerable and useful. 
One of these is the method which is called the method of Abel (1802-1829) even 
though it was extensively used by Euler (1707-1783) and was used by Leibniz 
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(1646-1716) and others before Euler. A given series u; + ug + uz + ->is 
assigned the value V by this method if the series 
(2) uy ur + uzg? + +: 
converges when 0 < r < 1 to the values f(r) of a function for which 
(3) lim f(r) = V. 
ra l— 


Use these ideas to find the Abel value of the series 1-1+1—1+-:°:. 
Hint: The series 1 —~r+r?—r3+ >» is a geometric series whose ratio is 
—r, and the series converges to 1/(1 + r) when |r| < 1 Remark: Our mathe- 
matical notations would be more sensible but less brief if we were accustomed to 
writing 


(4) s=Clu,tusetut-:-::} 


to abbreviate the statement that the series in braces is assigned the value s by 
the method of convergence and to writing 


(5) $= A{1-141-14+--+} 


to abbreviate the statement that the series in braces is assigned the value $ by 
the method of Abel. This more elaborate notation can show just what we are 
doing when we adopt the convenient but absurd old idea that a conglomeration 
of numbers and plus signs “is” a number or “represents” a number if and only 
if it converges to the number. An intelligible theory of series requires a suitable 
mixture of broad ideas of Euler and narrow ideas usually promoted by elementary 


books of the nineteenth and twentieth centuries. 


7 Each sequence 51, 50, 53, © © © of numbers determines its sequence Mi, Mz, 
M, © + - of arithmetic means defined by the formulas 
—stseteot tts 1 E a 
(1) M, = - =- » x (a= 1,2,3 °°). 
If 
(2) lim sn = 5, 


so that s, is near s whenever n is large, we can feel that M, should also be near 
s whenever n is large and hence that 


(3) lim Mn = s. 
Prove that (2) implies (3). Solution: Let e > 0. Choose an integer N such that 
lsn — s| < €/2 whenever n > N. Then, when n > N, 


— — e.. a— le 
(4) m, -s= G5dtG— pb Fen) LY aa 
k=l 
and hence 
1< 1 < C 
(5) |Ma — s| S ny, e= sl ta ls. — s| S z 
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N 
where C = D lsz — s|. If we choose Nı such that N, > N and C/n < e/2 
k=1 
when n > N, then we will have 


(6) \M, —s| <€ 


when n > N;. This proves (3). Remark: It often happens that the limit in 
(3) exists when the limit in (2) does not exist. In case uj + uz + ug + -iş 
a series having partial sums sı, 52, © * > and arithmetic means Mı, Mo, M3, --.- 
such that (3) holds, we can write 


(7) s = Cifu t uz +u t'e} 


and say that the series is evaluable to s by the method of arithmetic means or by the 
Cesaro method of order 1. 

8 Supposing that n is a positive integer, sketch a graph of the function f, 
for which f,(x) = n?x when |x| S 1/n and f,(x) = 1/x when |x| > 1/n. Show 
that f, is continuous over E}. Show that 


lim fa(x) = g(x), 


where g(0) = 0 and g(x) = 1/x when z ¥ 0. Hint: Consider separately the 
cases in which x = 0, x > 0, and x < 0. 
9 Using the notation of the preceding problem, let 


ur(x) = fi(x) 

u(x) = f(x) — filx) 
us(x) = f(x) — folx) 
us(x) = fax) — fal), 


etcetera, so that u(x) = f,(x) — fe_i(x) when k = 2, 3, 4,° °°. Show that 
each function u, is continuous over £; and that 


ce 


>, u(x) = g(a). 


k=1 


Remark: It is sometimes necessary to be sophisticated enough to know that a 
series of continuous functions may converge to a discontinuous function. More- 
over, we should be tall enough to peer over the wall of our garden and observe 
that a series u(x) + uo(x) + ++: of functions having partial sums fi(x), 
f(x), © > + is said to converge uniformly over a set E to f(x) if to each positive 
number e there corresponds an integer N such that |fa(x) — f(x)| < e whenever 
n 2 N and x is in E. The following theorem is proved in advanced calculus. 
Ij a series u(x) + u(x) + + + - of continuous functions converges uniformly over 
E to f(x), then f must be continuous over E. 

10 Starting with positive numbers a; and b, for which a; < bı, let sequences 
4i, de, 43, °° * and by, be, bz, © + + be defined recursively by the formulas 


a) ta Virb baa = ot 
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Show that, for each n = 1, 2,3,--- 


? 


(2) An < Ani < Davi < bn. 
Tell why there must exist numbers Lı and Lz such that 
(3) lim a, = Li, lim bn = La. 


n> a n—> 0 


Show also that 


(4) O < Past — angi < EED L a, = OM 
and hence 
(5) 0 <LL < (L: — Lı), 


sol, = Lı. Remark: The common value of the two limits in (3) has an impressive 
name; it is the arithmetico-geometric mean of the two given numbers a, and b}. 

11 This problem, which is in some respects the most significant problem in 
this chapter, would be much too difficult if it were not prefaced by a rather elab- 
orate story. We make the reasonable assumption that Mr. C., a particular 
carpenter, never heard of the Dedekind axiom 5.43, and that his ideas about the 
real-number system are incomplete. Next we make the reasonable assumption 
that the class R* (read R star) of numbers that Mr. C. knows about is the class 
of rational numbers which he may call “whole numbers and fractions.” This 
class R* is, for many purposes, a thoroughly useful class of numbers. If x and 
y belong to R*, so do x + y, x — y, xy, and also x/y, provided y = 0. While 
we may be somewhat surprised by the fact, it is nevertheless true that Mr. C. 
could define graphs, limits, derivatives, indefinite integrals, Riemann integrals, 
and many other things exactly as we defined them. Mr. C. could show, exactly 
as we did, that if f(x) = x?, then f'(x) = 2x. There would be many respects in 
which his analytic geometry and calculus would be thoroughly satisfactory. 
He would say, exactly as we did, that f is continuous at xo if to each e > O there 
corresponds a 6 > O such that | f(x) — f(xo)| < e whenever |x — xo| < 6, but of 
course only rational numbers appear in his work. Mr. C. would be totally 
unaware of the existence of irrational numbers, but we could nevertheless select 
an irrational number & for which O < £ < 1 and put Mr. C. to work studying 
the function f for which 


f=) = -1 (0 Sx < &) 
0) Pa =] (E<x Zl). 


Mr. C. would discover that f is defined for each x in R* for which O < x < 1, 
and hence he would say that it is defined over the interval 0 Sx S1. He could 
prove that f is continuous at each x in R* for which O0 <x <1. He would 
therefore say that it is continuous over the interval0 S x $1. He could prove 
that f'(x) = 0 for each x in R* for which O < x < 1. So far there is nothing 
wrong, but there will be something wrong if Mr. C. tries to tell us that f'(x) = 0 
when 0 < x < 1 and hence “‘it is obvious” or “it can be shown” that there must 
be a constant k such that f(x) = k when0 < x < 1. In fact, a look at the for- 
mulas (1) defining f shows that there is no constant k such that f(x) = k when 
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0<x< 1. It is now time to seek the moral of this story. If we are not sure 
whether the set of numbers we use in our analytic geometry and calculus is the 
complete set of real numbers for which the Dedekind postulate is valid, then we 
cannot be sure about the validity of the ideas that we need to enable us to do 
our chores. It is, therefore, not enough to know the axioms usually given in 
one way or another in elementary arithmetic and algebra and “finite mathe- 
matics.” We need, in addition, the Dedekind axiom or an equivalent axiom 
which guarantees that we are using the complete class of real numbers in our 
work. We now come to the problem. Tell whether it is necessary to use the 
Dedekind axiom (or, what amounts to the same thing, to use consequences of the 
Dedekind axiom or an equivalent axiom) in order to (a) prove the Rolle theorem 
5.51, (b) prove the intermediate-value theorem 5.48, (c) define the area of a 
rectangle to be the product of its dimensions, (d) define the derivative of a given 


2 
function f, (¢) prove existence of Í, (1/x) dx. 


5.7 Darboux sums and Riemann integrals This section can be 
omitted from this course without damaging understanding of the rest of 
the book. There can, however, be no doubt that students with serious 
interest in pure mathematics should master it and that everyone else 
should read it. The section gives substantial information about a stand- 
ard way of attacking matters relating to existence of Riemann integrals. 
Let f be defined and bounded over an intervala S x S b so that, for some 
constants m and M, we have 


(5.71) m S f(x) S M (a S x Sb). 


As in our definition of Riemann sums, let P be a partition such as the one 
shown in Figure 5.711 and, for each &, let x be selected such that x,_; $ 


— 
— 


a=% x Xz Xk- Tk x 


Figure 5.711 


xf S xp. Let Any = xk — xk. For each k = 1,2,°°° , 2, let 


(5.712) m = g.l.b. f(x), M, = lu.b. f(x) 


Zk- ST SLE Zk-1 ST STk 


so that m, and M; are respectively the greatest lower bound and the least 


upper bound of f over the interval x1 S x S xk. The numbers UDS(P) 
and LDS(P) defined by 


(5.72)  LDS(P) = Ý m Am,  UDS(P) = Ý M, Am 


k=1 k=1 


are called the lower and upper Darboux (1842-1917) sums determined by 
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Figure 5.721 


P. Figure 5.721 is available for inspection. For each choice of the 
points xý we have mą S f(x) < M, and hence 


nr 
(5.722) LDS(P) < Y f(xt) Ax £ UDS(P). 
k=l 

Therefore, for a given partition P, the different Riemann sums that can be 
formed by making different choices of the points xž are all sandwiched 
between the lower and upper Darboux sums. Information about 
Riemann sums can therefore be gleaned from information about Darboux 
sums. 

The first step in our study of Darboux sums may seem to be a very 
modest one. Let P be a given partition, and let P’ be a simple extension 
of P. By this we mean that P’ is exactly the same as P except that P’ 
contains one additional partition point, say X, which lies between two of 
the partition points of P, say x2 < X < x3}. The inequality 


Ua lub. J(=) (X — x2) +E l.u.b. ANG — x) 
< [ lu.b. KE - La) Lub. f@)l(%s — X) S Ms Axs 


z2: St S23 


implies that UDS(P’) < UDS(P). Figure 5.721 is not needed in the 
proof of the inequality but may nevertheless be helpful. Consideration 
of simple extensions of simple extensions of P leads to the conclusion that 
if P’ is any extension of P (so that P’ contains all of the partition points of 
P and perhaps also some additional ones), then UDS(P’) S$ UDS(P). 
An analogous argument, in which greatest lower bounds appear and the 
inequality signs are reversed, shows that if P’ is an extension of P, then 
LDS(P’) 2 LDS(P). Suppose now that P, and P, are two given parti- 
tions, and let P; be an extension of both P, and Pa. Then 

(5.723) LDS(P:) S$ LDS(P;) £ UDS(P;) £ UDS(P2) 

and hence 


(5.724) LDS(P1) = UDS(P»). 
This is a key result of the theory. Let the symbols in 


(5.73) L= MEG dx, U = ME dx 
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denote, respectively, the least upper bound of all lower Darboux sums 
and the greatest lower bound of all upper Darboux sums. These num- 
bers are, respectively, the ower and upper Darboux integrals of f over the 
interval a $x Sb. It follows from (5.73) that, for each partition Py, 
L < UDS(P.), and it follows in turn from this that L £ U. Thus 


(5.731) LDS) s L s U s UDS() 


for each partition P. 

There are bounded functions f for which L < U. For example, let 
a = 0, let b = 1, and let f(x) = 0 when x is irrational and f(x) = 1 when 
x is rational. The LDS(P) = 0 and UDS(P) = 1 for each P and there- 
fore L = O and U = 1. 

It can be proved that LDS(P) is near L and UDS(P) is near U whenever 
|P| (the norm of P) is small. This result, which is sometimes called the 
Darboux theorem, means that to each e > 0 there corresponds a ô > 0 
such that 


(5.74) ILDS(P) — L| < «, [UDS(P) — U| <e 


whenever |P| <6. This and (5.731) imply that, when |P| < ô, the 
numbers LDS(P) and UDS(P) are respectively located in the left and 
right intervals of Figure 5.741 when L < U and of Figure 5.742 when 


p DS(P) ¢ UDS(P) p LDS(P) f UDS(P) 
L-e L U U+e I-€ I I+e 
Figure 5.741 Figure 5.742 


L = U =I. Consider first the case in which L < U. Since each 
Darboux sum can be approximated as closely as we please by a Riemann 
sum having the same partition points, it follows that there exist Riemann 
sums with norm |P| < 6 which differ from L by less than e and that there 
also exist Riemann sums with norm |P| < 5 which differ from U by less 
than e. It follows that if L < U, then f cannot be Riemann integrable 
overa Sx Sb. 
Consider next the case in which L = U = I. In this case 


(5.743) I — e < LDS(P) < RS(P) Ss UDS(P) < I +e 


whenever RS(P) is a Riemann sum formed for a partition P for which 
IP| < 8. Therefore, 


(5.744) [d= I, 


the integral being a Riemann integral. 
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All this gives the following theorem which involves the numbers L and 
U defined in the sentence containing (5.73). 

Theorem 5.75 If f 1s defined and bounded over a S x S b, then fis 
Riemann integrable over a Sx Sb if and only if L= U. Moreover, 
(5.744) holds when L = U = I. 

This theorem and (5.731) imply the following useful theorem. 

Theorem 5.751 A function f is Riemann integrable over a S x < b if 
and only if to each e > Q there corresponds a partition P such that 


(5.752) UDS(P) — LDS(P) < e 


The above story provides ideas and results that are used in proofs of 
the fundamental theorem (Theorem 4.26) on existence of Riemann 
integrals. We shall use Theorem 5.751 to prove some less pretentious 
theorems. 

Theorem 5.76 If f 1s defined and monotone increasing (or monotone 


. . . 6 . 
decreasing) over a S x S b, then the Riemann integral Í. f(x) dx exists. 


Let e > 0. Suppose first that f is monotone increasing so that f(x’) < 
f(x") when a S x’ <x” <b. Let P bea partition of a S$ x S$ b with 
partition points x0, %1, °° * , Xa as in Figure 5.711. Then 


(5.761) UDS(P) — LDS(P) = ş [ lub. f(x) — glb. f(x)] Ax 


km] Tk-1 SX Ste tk-1 ST STk 


n 


¥ e) — femal dn < D (es) = fla 


k=1 k=] 


[SE — FPI < e 
provided |P| is sufficiently small. This and Theorem 5.751 establish 


the result for the case in which f is monotone increasing. In case f 
is monotone decreasing, the proof is exactly the same except that [f(x+) — 
f(xz-1)] is replaced by [f(xr-1) — f(xz)] and [f(b) — f(a)] is replaced by 
[fa — f). 

It is easy to extend Theorem 5.76 to obtain a better theorem. A func- 
tion f is said to be bounded and piecewise monotone over the closed 
interval a S x S b if there is a constant M for which |f(x)| < M when 
a £x Sb and if there is a partition P of the interval a S x < b such 
that, whenever x,_; and x, are two consecutive partition points, f is 
monotone (maybe monotone increasing, maybe monotone decreasing) 
over the open interval x,-1 < x% < xx. 

Theorem 5.762 If f is bounded and piecewise monotone overa S x S b, 


then the Riemann integral [ ° f(x) dx exists.T 


f It is sometimes said that this theorem is a poor-man’s version of a stronger theorem 
which says that f is integrable over a S x < b if f has bounded variation over a S x £ b. 
Problem 10 at the end of this section provides opportunities to rise above poverty. 
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Similarly, we apply the fundamental Theorem 5.731 to prove Riemann 
integrability of continuous functions and piecewise continuous functions. 

Theorem 5.77 If f is continuous over a S x S b, then the Riemann 
integral Í > f(x) dx exists. 

To prove this, let e > 0. Theorem 5.58 then enables us to choose a 
positive number 6 such that |f(x2) — f(xı)| < e/(b — a) whenever 
a Š xı Lb, a Sx S 5), and |x. — xı| < 6. Let P be a partition of the 
interval a S x Sb for which |P| < ô. Then, with the notation of 
(5.712) and (5.72), we have My — m, S ¢/(b — a) and hence 


n 
(5.771) UDS(P) — LDS(P) = } p= Am = e. 
k=1 
The required conclusion then follows from Theorem 5.751. 

A function f is said to be piecewise continuous over the closed interval 
asx S bif itis defined over a S x < b and if there is a partition P of 
the interval a S x S b such that, whenever x,_; and x, are two con- 
secutive partition points, f is continuous over the open interval x,_1 < 
x <x, and, in addition, the unilateral limits 

lim f(x), lim f(x) 
> okt D> TkT 
both exist. On account of the fact that functions that are piecewise 
continuous over a closed interval must be bounded, it is not difficult to 
use Theorem 5.77 to prove the following more general theorem. 

Theorem 5.772 If f is piecewise continuous over a S x <S b, then the 
Riemann integral extsts. 

Finally, we use ideas and notation of this section to prove the following 
theorem. 

Theorem 5.78 If f is Riemann integrable over the interval a S x Sb 
and f(x) 20 when a Sx Sb, then the set S of points (x,y) for which 

a Lx Sb, 0 Sy S f(x) possesses an area 


M g [S| and |S| = EO, dx. 
me The proof depends upon the funda- 
s mental definition of area given in Defini- 
tion 4.44. Choose a constant M such 
° Yk- Th b * that f(x) S M — 1 and observe that S is 
Figure 5.781 a subset of the large rectangle R of Figure 


5.781. Let e> OandletO <e <e Let 
the number |S| be defined by the formula |S| = Í, ° f(x) dx. To prove 


the theorem, we shall show that |S| is in fact the area of S. Let P bea 
partition for which 


(5.782) p ff) Axe S |S| + TA È fxf) dm Z S e 
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whenever *s-1 S xý S ær for each k. Defining M; and m, by (5.712), we 
conclude that 


nT nr 
(5.783) » Mi Axr S|S| +é¢, È m An = |S| e. 
Let R, and R, be, for each k, the rectangles (meaning rectangular regions) 
consisting of points (x,y) for which xı Sx Sm, O S y S M, and 
xr- LX < Xk, M Sy SM. The two formulas (5.783) then give 


(5.784) pA IRil < |S| +, » IRi| < |R| — |S| te. 

If P(x,y) lies in S, then there is at least one k for which x-1 < x < x, and 
hence 0 < y S f(x) S Mr, so P is a point of at least one rectangle R}. 
Similarly, if P(x,y) is a point of the set S’ consisting of the points in R but 
not in S, then there is at least one & for which x,_; S x < x, and hence 
m, = f(x) S M, so P is a point of at least one rectangle R}. It is there- 
fore a consequence of Definition 4.4 that the set S does possess an area 
and that its area is |S|. This completes the proof of Theorem 5.78. 
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1 Sketch a dozen graphs that look like graphs of functions f that are bounded 
and piecewise monotone over the interval0 S$ x < 1. Besure to include graphs 
of some discontinuous functions and of some nonmonotone functions. 

2 Sketch a figure which is like Figure 5.721 except that the partition P con- 
tains 10 or 20 partition points that are roughly equally spaced. Then look at 
your figure and see how LDS(P) and UDS(P) are related. 

3 Sketch a figure which shows the geometric meanings in the statement and 
proof of Theorem 5.76. 

4 As was remarked, Archimedes (287-212 3B.c.) knew about some special 
Riemann sums, and this matter may be worthy of brief consideration here. 
When f is defined over rational values of x in the interval 0 < x < 1, we can 
make a partition of the interval 0 S x £ 1 into n equal subintervals of length 
1/n by partition points x; for which x, = k/n and form the special Riemann sum 


< k\ 1 
da= YG) 


which we can call an Archimedes sum. Without implying that Archimedes used 
modern terminology involving sums, limits, and integrals, we can recognize that 
there is historical evidence that we are merely putting ideas of Archimedes into 
modern terminology when we say that f is Archimedes integrable over the interval 
0 $x < land that f has the Archimedes integral Jif 4, >I as n— œ. Now 
comes the problem. Supposing that f(x) = 0 when x is irrational and f(x) = 1 
when x is rational, show that if the symbol 


fy fe a 
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represents an Archimedes integral, then the integral exists and has the value 1, 
but that if the symbol represents a Riemann integral, then the integral does not 
exist. 

5 For each n = 3, 4, 5, -+ - the broken line joining in order the points 
(0,0) (1/2,n), (2/n,0), (1,0) is the graph of a function fa» defined over the interval 
O<xs1. Prove that 


(1) im fy fale) de =1, fy [lim fala) de = 0. 


Hint: Observe that fa(0) = 0 for each n 2 3 and hence lim f,(0) =0. IfO0< 
n= © 
x <1, then f,(x) = 0 when 2/n < x, and hence when n > 2/x, so again lim f,(x) 
n— 0 


= 0. Remark: Persons who push very far into the theory of Fourier series learn 
that if 


_ 2 fsin nx\? _ o. 
(2) Fax) = = zr) (n=1,23, --°) 
then 
x/2 T/2 
(3) lim Í, Falz) de=1, fO Uim Fa(e)] dx = 0. 


While consideration of the matter can be postponed, our course in analytic 
geometry and calculus should be leading us toward abilities to sketch a graph 
of F, and to appreciate the fact that the two formulas in (3) can be valid. 

6 This problem invites investment of time in a speculative venture. It was 
proved in Section 4.3 that the formula 


(1) fi F'G) ds = Fe |. =F® — Fla) 


is valid whenever F has a continuous derivative over the interval a Sx < b. 
It was proved in Problem 18 of Problems 5.59 that (1) is valid whenever F’ 
exists and is Riemann integrable over the interval a £ x < b. Even though 
nobody requires us to learn everything, we may sometime be benefited by knowl- 
edge that there is a function F for which (i) F'(x) exists when —1 < x S 1 and 
(i1) F’ is not continuous but is Riemann integrable over —1 <x <1. Let F 
be defined by the formulas F(0) = 0 and 


(2) F(x) = —x? cos l + Ir 2t cos : dt (x = 0). 


That F’(0) = 0 can be proved by using the inequality 


3) Fa) = FO) 


l 


< | x cos - 
x 


+ 


L f” [21 a < 2a] 
x J0 
and the sandwich theorem. When x æ 0, differentiating (2) gives 


(4) F(x) = sin + (x = 0). 
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Thus F’(x) exists when —1 Sx £ 1. Asx approaches 0, F’(x) oscillates between 
—1 and 1 and does not have a limit, so F’ is not continuous at the place where 
x =0. However, F’ is Riemann integrable over the interval —1 <x <1 
because F(x) exists and is bounded over the interval and is continuous except 
at one place. Thus our function F has the required properties. We could have 
used the formula 


(5) F(x) = f 7 in : dt 


instead of (2) to define F. It would then have been slightly easier to obtain (4) 
but would not have been so easy to show that F’(0) = 0. There is a reason why 
no simpler example can be given. Derivatives must have the intermediate- 
value property, and no discontinuous function having the intermediate-value 
property is simpler than the function ¢ for which (0) = 0 and ¢(x) = sin (1/x) 
when x ¥ 0. 

7 If the unique individual that some textbooks like to call “the student” 
is unable to prove that each polynomial is bounded and piecewise monotone 
over each interval a S x <S b, there are only three possible places to place the 
blame. Is it the student? Is it the textbook? Is it the problem? 

8 We have, at one time and another, seen examples of faulty applications of 
the noble but frequently invalid premise that a thing T must be an element of a 
set S if T is the limit of a sequence of elements of S. One old example involves 
the “idea” that a circle must be a polygon because it is the limit of polygons. 
Another old example involves the “idea” that a Riemann integral must be the 
sum of infinitely many things because it is the limit of sums. Should we swallow 
the “idea” that an irrational number must be a rational number because it is the 
limit of rational numbers? Ans.: No. 

9 We have the possibility of extending our intellectual horizons by investing 
a few minutes or a few years in study of algebras which differ from the algebra 
of real numbers. The algebra of rational functions invites us to consolidate old 
ideas and capture new ones. When ap, 41, © * ©, Am and bo, bi, © + + , ba are 
constants for which the b’s are not all zero, the two polynomials P and Q for which 


P(x) = dg + aye tires ft anx™ Q(x) = bo t bix +--+ + bnr" 


determine the rational function f for which f(x) = P(«)/Q(«) for those values 
of x for which Q(x) #0. The sum f + g of two rational functions is the rational 
function hk for which h(x) = f(x) + g(x) for each x for which the sum is defined. 
If cis a constant and f is a rational function, then cf is the rational function having 
values cf(x). If f and g are rational functions, then fg is the rational function 
having values f(«)g(x) and [unless g(x) = 0 for each x] f/g is the rational function 
having values f(x)/g(x) when g(x) #0. Textbooks in modern algebra call atten- 
tion to many respects in which the algebra of rational functions is like the algebra 
of real numbers. Terminologies involving rings, fields, and groups facilitate dis- 
cussions of these matters. Nontrivial interest in the algebra of rational functions 
starts to develop when order relations are introduced in a particular special way. 
We say that f < g and g > f if there is a number xo such that f(x) < g(x) and 
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g(x) > f(x) for each x for which x > xo. Progress with the theory depends upon 
the basic fact that if f and g are rational functions, then f(x) = g(x) for each x 
or there is a number xo such that f(x) < g(x) when x > xo or there is a number 
xo such that f(x) > g(x) when x > xo. This basic fact depends upon the fact 
that if 4 is a rational function, then either h(x) = 0 for each x or there is a number 
xo such that A(x) is continuous and positive or continuous and negative when 
x > xo It follows that if f and g are rational functions, then one and only one 
of the three relations f < g, f = g, f > gis valid. Thus the set of rational func- 
tions is, like the set of real numbers, now an ordered field. Let fo, the zero function, 
be the rational function for which fo(x) = 0 for each x. Our algebra of rational 
functions is said to have the Archimedes property (or to be Archimedian) if to 
each pair of functions f and g for which f > fo and g > fo there corresponds an 
integer n for which nf > g. This is of interest to us because we have proved, 
with the aid of the Dedekind axiom, that the algebra of real numbers is Archi- 
median. It could be presumed that the algebra of rational functions is so much 
like the algebra of real numbers that the algebra of rational functions must be 
Archimedian. However, the presumption is false, the algebra of rational func- 
tions is not Archimedian. To prove this, let f and g be the rational functions 
for which f(x) = x and g(x) = x% Careful applications of our definitions then 
imply that f > fo, g > fo, and nf < g for each integer n. Thus our algebra of 
rational functions is not Archimedian. When all matters which we have dis- 
cussed are thoroughly understood, it becomes clear that the Archimedian prop- 
erty of the algebra of real numbers is not a consequence of those properties of real 
numbers that are ordinarily stated in elementary arithmetic and algebra. Alge- 
bra books that give adequate treatments of matters relating to order relations, 
bounds, limits, Dedekind partitions, and Archimedes properties are said to be 
modern. We have seen some of the reasons why knowledge of modern algebra 
is considered to be an essential part of a mathematical education. 

10 While consideration of the matter is usually reserved for more advanced 
courses, we have enough equipment to understand, and perhaps even prove, 
basic facts involving functions of bounded variation. Let f(t) be defined over 
asxtsbandleta<xsb. Supposing f such that T(x) exists (is finite) we 
define numbers T(x), P(x), and N(x) by the formulas 


(1) Lud. 2) 1/6) - fel = Te) 
1 n 

(2) Lub. 5 x UG — fad) + 1G) — fe} = PO) 
1 n 

(3) Lub. 3 Ò E — fedl — E — a = NG). 


In each case, the least upper bound is the least upper bound of sums obtained for 
partitions P of the interval a < t S x. The function f is said to have bounded 
variation (the term finite variation would be better) over the interval a £ t <S b. 
Let T(a) = P(a) = N(a) = 0. The numbers T(x), P(x), and N(x) are, respec- 


tively, the total variation, the positive variation, and the negative variation of f over 
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the interval a St £ x. Prove that T(x), P(x), N(x) are all nonnegative and 
monotone increasing over a Sx < b. Prove that 


(4) T(x) = 2P(x) — [f(«) — f(a)] 

(5) T(x) = 2N(x) + [f@) — fa) 

(6) T(x) = P(x) + N(x) 

(7) f(x) = f(a) + P(x) — N(x) 

(8) fæ) = [x + f(a) + P(x)] — [x + NG@)] 


whena = x S b. Useourinformation to prove that if f(x) has bounded variation 
over a S x <S b [which means that T(b) is finite], then f(x) is the difference of two 
increasing functions. Prove that if f(x) is the difference of two increasing func- 
tions over a £ x S b, then f(x) has bounded variation over a S x < b. Hint: 
To obtain (4), note that (2) contains a telescopic sum and put (2) in the form 


i — 
9) Lub. {3 D eo = sanl +P} = Pe. 


k=1 


Remark: Our results and Theorem 5.76 imply that the Riemann integral f > f(x) dx 
a 


exists if f has bounded variation overa £ x Sb. Moreover, we now have enough 
information to appreciate the most important theorem in the theory of Riemann- 
Stieltjes integrals; see Problem 11 of Problems 4.89. The theorem says that 


(10) EGEO 


exists if f is continuous and g has bounded variation over a Sx £ b. Methods 
of this section provide proof for the case in which f is continuous and g is increas- 
ing, and the general result is then obtained by expressing g as the difference of 
increasing functions. A much more difficult theorem says that if g is such that 
(10) exists whenever f is continuous over a S x S b, then g must have bounded 
variation over a Sx Sb. 


6 Cones 


and conics 


6.1 Parabolas Before plunging into the general aspects of this 
chapter, we obtain more information about the parabolas that were 
introduced in Section 1.4. Being realistic, we face some facts. We 
remember that, for some strange reason, the graph of y = kx? is, when 
k > 0, a parabola, but details involving the focus and directrix of this 
parabola may have been quite thoroughly forgotten. We try to recall, 
and henceforth remember, that the parabola has a focus F and a directrix 
as in Figure 6.11 and that the parabola is the set of points P(x,y) for 
which |FP| = |DP|. We have forgotten how the coordinates of F and the 
equation of the directrix are related to k, and we may forget again, so we 
should know how to discover the facts. To put a little variety into our 
lives, we use the symbol “?” to represent the unknown distance from the 
origin to F and from the origin to the directrix. Now we make the key 
observation. The points on the horizontal line through F all lie at dis- 
tance (2?) from the directrix. Hence the point (2?,?) which lies (2?) 


units to the right of F must lie on the parabola. The coordinates of this 
354 
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point must therefore satisfy the equation of the parabola,so? = k(2?)? and 
? = 1/4k. ‘The coordinates of F are therefore (0, 1/42), and the equation 
of the directrix is y = —1/4k. The square of Figure 6 11 having a vertex 


Directrix __” D 


Figure 6.11 


at F and two vertices on the directrix is called a focal square of the parabola. 
Another focal square lies to the left of the one in the figure. A figure 
which shows a parabola together with its focus and directrix is imperfect 
unless the parabola contains a vertex of each focal square. 

The y axis, being an axis of symmetry and the only one, is called the 
axis of the parabola. The point in which the parabola intersects its axis 
is called the vertex of the parabola. More definitions will appear in the 
problems. While parabolas have important applications in which foci 
(plural of focus) and directrices (plural of directrix) never appear, most of 
the problems involve situations in which they do appear. 

Since preliminary ideas can be very valuable, we look briefly at Figure 
6.12. The figure gives six views of the intersection of a cone and a plane 


Figure 6.12 


The cone is a right circular conical surface a part of which resembles a 
conical paper cup or ice-cream cone. The vertex V and the axis of the 
cone are in the plane of the paper. The intersecting plane is parallel to 
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a line on the cone. The intersection is a curve of which a part (the solid 
part) lies on the front half of the cone and a part (the dotted part) lies on 
the back half of the cone. Our present requirement is exceedingly 
modest. All we are required to do is grasp the idea that the curve looks 
like a parabola. Our solid information about this matter will come in the 
next section. Meanwhile, persons with artistic flairs can find useful 
entertainment in sketching sections of cones made by planes not parallel 
to lines on the cones. 


Problems 6.19 


1 Sketch a graph showing the parabola whose equation is y = x? together 
with the focus and directrix of the parabola. Draw the focal squares and make 
any repairs that may be necessary to make the parabola contain corners of the 
focal squares. Prove that the tangents to the parabola at the latter corners 
are diagonals of the focal squares, and make any additional repairs that may be 
necessary. 

2 Problems of this section deal quite exclusively with parabolas placed 
upon coordinate systems in such a way that their equations have the standard 
form y = kx?, where k is a positive constant. We can, however, pause briefly 
to note that the equation 


(1) y — yo = k(x — xo)? 


is the equation of a parabola having its vertex at the point (%o,yo). Supposing 
that a, b, c are constants for which a ¥ 0, show that the graph of the equation 


(2) y = ax? +bx+c 
is a parabola and find the coordinates of its vertex. Solution: From (2) we obtain 
(3) y a(x+2x \+e 
a(t wei 
4ac — -# 
a(« 4 =) 4 ae 7 ac 


l 


and hence 


(4) y+ = a(x+Z): 


Thus the graph of (2) is a parabola having its vertex at the point (- 2, 


2a 
— “te. Remark: In case a > 0, the equation (4) has the form (1), where 


k > 0 and the graph “opens upward” like the graph of y = kx®. Incase a < 0, 
(4) has the form 


(5) yY — Yo = —k(x — xo)?, 
where k > 0 and the graph “opens downward” like the graph of y = —kx?. 
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These matters are important because equations of the form (2) appear very often, 
but for basic studies of parabolas we use the standard form y = kx*, where k > 0. 

3 Show that the tangent to the graph of the equation y = kx? at the point 
P (1,22) has the equation 


y — kx? = 2kx(x — x1) or y = kx,(2x — x). 
Show that this tangent intersects the y axis, the x axis, and the directrix at the 
points 
A(0, —kx?), B (2 0), c(2 Z, +) 
= 4k 


provided, for the last point, x; #0. Sketch a figure in which the parabola, the 
tangent, and the points 4, B, C all appear. 

4 Show that the normal to the graph of the equation y = kx? at the point 
P(x1,kx3) has, when xı ¥ 0, the equation 


y— kx? = — (x — xi). 


2kxi 
Show that this normal intersects the y axis, the x axis, and the directrix at the 
points 


A* (0, 4 + kx’), B¥ (x, + 2h2x3, 0), c* i + 2kxi, — +). 
2k 4k 
Sketch a figure showing all of these things. 

5 Two points (%1,y1) and (x2,ye) lie on the parabola having the equation 
y = kx®. Prove that the coordinates of the intersection R of the y axis and the 
line through these points can be put in the form 
(0,—x\x2). Figure 6.191 illustrates results of 
this and the next two problems, but the figures 
look quite different when x; and x2 have opposite 
signs. 

6 Two points (%1,y1) and (%2,y2) lie on the 
parabola having the equation y = kx?. Prove 
that the coordinates of the intersection of the 
tangents to the parabola at these points can be 
put in the form 


(2 + x2 ) 
a ae kxixo . 


7 Show that the results of the two preceding 
problems yield the following theorem. Let P, and Pz be two points on a parabola. 
Let Q be the intersection of the tangents to the parabola at Pı and Pe. Let R be 
the intersection of the line P,P. and the axis of the parabola. Then the mid- 
point of the segment QR lies on the line tangent to the parabola at the vertex. 
Solution: The results of the preceding problems show that the mid-point is 


Figure 6.191 


x1 + x ow, 
(=, 0), and this point is on the tangent to the parabola at the vertex. 
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8 Let P, be a point on a parabola which is not the vertex V. Let W be the 
intersection of the tangents to the parabola at Pı and VY. Show that the line 
from the focus F to W is perpendicular 
to the line WP,. Hint: Let the parabola 
have the equation y = kx? and use the fact 
that F has coordinates (0, 1/4k). See 
Figure 6.192. 

9 Let Pı be a point on a parabola which 
is not the vertex VY. Prove that the tangent 
to the parabola at P; meets the directrix and 
the line through the focus F parallel to the 
directrix at two points Q and R that are 
equidistant from F. See Figure 6.192. 

10 A particle P moves on the parabola 
having the equation y = kx? in such a way 
that, at each time #, its x and y coor- 
dinates are ¢ and &t? and the vector r running from the origin to P is 


r = fit kej. 


Figure 6.192 


Show that the velocity vector V is 
v = i + 2kij, 


and note that this vector is also a “forward tangent” to the parabola at P. 
Letting F be the focus of the parabola, show that 


= ż 2 ; d = £72 + —- 
FP =ti+ (i: =) an |FP| z 


Letting ġı be the angle between the vector FP and the tangent vector v, show 
that 


_ FPy Ok 
IFPI] V1 + 422 


Letting ¢2 be the angle between the tangent vector v and the vertical vector j, 
show that 


cos ¢; 


2kt 
s/1 + 4k: 


and hence that de = ġı. Remark: These formulas yield the famous reflection 
property of parabolas. They imply that the line FP and the line extending 
upward from P make equal angles with the normal to the parabola at P. This 
implies that if light or something else goes in a line from F and is reflected from 
the parabola in such a way that the angle 6, of reflection is equal to the angle 
6, of incidence, then its path after reflection is parallel to the axis of the parabola. 

11 Modify the work of the preceding problem to make a direct attack upon 
the angles which FP andj make with the normal to the parabola at P. Remark: 


This should be done when our primary interest lies in angles of incidence and 
reflection. 


-Ji 
cos ee = i 
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12 This problem and Figure 6.193 
delve a bit deeper into the geometry of 
parabolas. The figure shows the pa- 
rabola having the equation y = kx?, 
where k >0O. The focus F and the 
directrix have the coordinates (0, 1/42) 
and the equation y = —1/4k. Let 
x, > 0. Show that the line through 
P3(m, kx?) parallel to the axis of the pa- 
rabola intersects the directrix at the 
point Dı(xı, —1/4k). Show that the 
tangent to the parabola at Pı intersects 
the axis of the parabola at the point 
0,(0,—kx;). Show that the quadrilat- 
eral Q,D,PiF is a rhombus, that is, an 
equilateral parallelogram. This rhom- 
bus could be called a focal rhombus; in Figure 6.193 
any case it is a focal square when 
x, = 1/2k and the rhombus isa square. Show that the diagonals of this rhombus 
are perpendicular to each other and that they intersect at the point (x,/2, 0). 
Finally, show how these results and elementary geometry can be used to prove 
the reflection property of the parabola, namely, that the line from the focus to 
Pı and the line through P, parallel to the axis of the parabola make equal angles 
with the tangent to the parabola at P,. 

13 Write the equation of the tangent to the graph of the equation y = x? 
at the point (xı,x?) and then try to determine x; so the tangent will contain (or 
pass through) the point 


(a) (1,1) (4) (1,0) (e) (0,1) (d) (—1,-1). 


14 Find the equation of the normal to the graph of y = x? at the point (x1,x) 
on the graph. Show that if yo <S $, then there is only one value of xı for which 
the normal passes through the point (0,yo), but that if yo > $, then there are 
three values of xı for which the normal passes through the point (0,9). Sketch 
a figure or figures which show that the results seem to be reasonable. 

15 As is the case for circles, a line segment joining two points on a parabola 
is a chord of the parabola, and the set of mid-points of the chords parallel to a 
given chord is called a diameter of the pa- 
rabola. Thus a diameter is a point set, not Figure 6.194 
a number. Letting the parabola have the 
equation y = kx®, where k > 0, prove that 
for each m the diameter determined by 
chords having slope m is the line segment 
containing points (x,y) for which x = m/2k 
and y = m?/4k. 

16 A focal chord of a parabola is a line 
segment which contains the focus and hasits 
ends at points on the parabola. Supposing 
that x2 < 0 < x; as in Figure 6.194, show 


P(x, hx) 
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that the two points Py (x1,Rx%), Palza kaa) on the graph of y = kx? are end points 
of a focal chord if and only if (2kx1)(2kx2) = —1 and hence if and only if the 
tangents to the parabola at Pı and P2 are perpendicular. 

17 With or without the aid of the results of the preceding problems, show 
that two different tangents to a parabola intersect on the directrix if and only if 
the tangents are perpendicular and hence if and only if the points of tangency 
are ends of a focal chord. 

18 Prove that the center of a focal chord of a parabola is equidistant from the 
directrix and the ends of the chord. 

19 Two equilateral triangles in £2 are similar in the sense that one can be 
transformed into the other by a translation, a rotation, and a change of scale 
Show that the same is true of two parabolas in Ee Haint: Suppose that the two 
given parabolas are translated and rotated so that their equations become 
y = kx? and y = kax?, where kı and k are positive constants. Show that if in 
the first equation we change scale by replacing x and y by Ax and Ay, we obtain 
y = (Aki)x?. 

20 Let k be a positive constant. For each positive number a, let F(a) be 
the y coordinate of the center of the circle tangent to the graph of y = kx? at 
the points for which x =a and x = —a. Find F(a) and lim F(a). Ans.: 

a— 


ka? + 1/2k and 1/22. 

21 Let k be a positive constant. For each positive number a, let (G(a), 
H(a)) be the center of the circle which is tangent to the graph of y = kx? at the 
point (a,ka?) and which contains (or passes through) the origin. Show that 


G(a) = ; ka? + 5 H(a) = — ka? 


a—~0 


and lim G(a) = 1/2k, lim H (a) = 0. 
a—0 


22 Sketch a graph of the parabola having the equation y = x? and then 
sketch several circles which have centers on the positive y axis and are tangent to 
the x axis at the origin. Observe that sufficiently big circles in this family inter- 
sect the parabola at points different from the origin and that small circles leave 
us in doubt. Supposing that k > 0, investigate this matter for the parabola 
having the equation y = kx?. Ans.: The circle with center at (0,2) and radius 
a intersects the parabola only at the origin (and is elsewhere above or “‘inside” 
the parabola) if and only if a < 1/2k. Thus the biggest one of these circles 
which lies completely on or inside the parabola has radius equal to the distance 
from the focus to the directrix of the parabola. 

23 Study the set S which contains a point P(x,y) if and only if the point is 
Figure 6.195 equidistant from the x axis and the circle with center at 

` the origin and radiusa. Solution: This problem is interest- 

ing because S contains some points inside the circle as 

well as some points on and some points outside the circle; 

see Figure 6.195. Whether a point P(x,y) lies inside or 

x on or outside the circle, it will be in the set S iff (if and 
only if) 


(1) [væ + y? — a] = jy] 


Y 
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and hence if 


(2) Vetyt=aty 
If (2) holds, then 
(3) x? + y? = a? + day + y? 
and hence either 

_~ 4, x 
($) y 2 + 2a 
or 

-2% 


It can be shown that if (4) or (5) holds, then (1) holds. It follows that S is the 
sum (or union) of two parabolas of which one has the equation (4) and the other 
has the equation (5). Each parabola has its focus at the origin, and the directrices 
are the tangents to the circle that are parallel to the x axis. The parabolas 
intersect the x axis where the circle does. 

24 With or without the aid of results of preceding problems, let P be a given 
parabola and verify the following facts which show students of mechanical draw- 
ing how to locate the axis, vertex, focus, and directrix of P. The mid-points Mı 
and M» of two parallel chords C, and C, of P determine a line L, parallel to the 
axisof P. The axis L of P is the perpendicular bisector of the line segment joining 
points where a line perpendicular to Lı intersects P. In case L; is not the axis of 
P, the mid-points M; and M, of chords Cj and C, perpendicular to Cı and Co 
determine another line Lj parallel to L. Let Lı and Lj intersect the parabola at 
P, and Pi. Then the line T, (or T4) through P, (or P}) parallel to Cı (or C}) is 
tangent to P at P, (or Pi). Moreover T, and 7; are perpendicular, and their 
intersection is on the directrix of the parabola. Finally, the line segment joining 
P, and Pi is a focal chord of the parabola so this segment intersects L at the focus. 
Remark: Anyone who spends a substantial part of his lifetime working with 
parabolas can learn very much about them. 


6.2 Geometry of cones and conics Throughout this chapter, a cone 
is always a complete right circular conical surface consisting of two parts, 
or nappes, asin Figure6.21. Weassume that < a < 7/2 
and that the lines on the cone all make the same angle a 
with the axis of the cone. For the present, we simplify 
our discussion by supposing that the axis of the cone is 
vertical, and hence that each plane perpendicular to the 
axis is horizontal. A conic (or conic section) is the set of 
points in which a plane v intersects the cone. Incase 
contains the vertex V, the resulting conic is either a single 
point or a single line or a pair of intersecting lines. In 
case m is perpendicular to the axis of the cone and does 
not contain the vertex, the conic isa circle. Our interest 


Figure 6.21 
Axis 
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in this chapter lies in conics of less simple natures. ‘These turn out to be 
parabolas and ellipses, each of which intersects only one nappe of the 
cone, and hyperbolas, each of which intersects both nappes of the cone. 

Without yet knowing what will be 
learned, we look at Figure 6.22, which 
is one of the most remarkable figures 
of elementary geometry. This is a 
flat nonperspective figure which must 
be constructed and studied rather 
carefully before it can be fully under- 
stood. The vertical line in the plane 
of the paper is the axis of a cone with 
vertex VY and vertex angle œ. Theline 
making the acute angle @ with the axis 
represents more than a line. It is 
supposed to lie in the plane of the 
paper, and it represents a plane r 
which makes the angle 8 with the axis 
of the cone and which intersects the 
coneinaconic K. Wecan, if we wish 
to do so, think of the plane z as being 
Figure 6.22 an xy plane in which the x axis is 

` pointed toward our eyes, and every- 
thing in this plane seems to lie on one line. The graph of each point P on 
the conic K, whether P lies on the part of the cone in front of the plane of 
the paper or on the part behind the paper, is on the line. Our first 
significant step is to fit a sphere into the cone, the sphere being just big 
enough to be tangent to the plane r. The circle in the figure represents 
this sphere, which is tangent to the cone at the points of a circle which 
lies in the horizontal plane mı and which is tangent to the plane r at 
the point F. 

It can now be revealed that discoveries will be made; in fact we shall 
show that F is a focus of the conic K. Because m, is horizontal and = is 
not, these planes intersect in a line L which is represented by a single 
point in the flat figure. We shall show that L is a directrix of the conic K. 
To start learning something about the conic K, let P be a point on K 
and draw the line segment PD which lies in x and is perpendicular to L 
at the point Don L. The line PV lies on the cone and is tangent to the 
sphere ata point 4inw,. The line PF lies in x and is tangent to the sphere 
at F. Therefore |PF| = |PA| because the two vectors have their tails 
at the same point P and are tangent to the sphere at their tips. If we 
let d be the distance from P to the plane m, then d = (4P| cos a because 
the vector AP makes the angle a with vertical lines. Alsod = [DP] cos ĝ 
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because the vector DP makes the angle @ with vertical lines. Equating 
the two expressions for d gives the formula 


[ZP] cos a = IDP] cos 8. 
This and the fact that \PF | = IPA | give the fundamental formula 


(6.23) PP = * PD] = APD 


OS a 


where the constant e defined by the formula 


(6.231) e = $, 

COS @ 
is called the eccentricity of the conic. The point F and the line L are 
respectively a focus and a directrix of the conic. It is clear from Figure 
6.22 that, whenever a and $ are given acute angles, we can make the dis- 
tance from F to L have any positive value p we please by taking r at the 
appropriate distance from VY. The equation (6.23) is called an intrinsic 
equation of the conic, that is, an equation that depends only upon the 
conic itself and not upon the coordinates of a particular “‘external”’ 
coordinate system. 

In case 8 = a, the conic is called a parabola. In this case, (6.231) 
shows that ¢ = l and that the formula (6.23) reduces to the simpler 
formula |PF| = |PD|. Thus we have, as was promised in Section 1.4, 
proved that a parabola is the set of points P (in a plane) equidistant from 
a fixed point (the focus F) and a fixed line (the directrix L). 

In case a < B < 2/2, the conic is called an ellipse. In this case 


(6.24) IPF) = e|PD\, 


where the eccentricity is a constant e for whichO < e <1. As we can see 
from Figure 6.22, an ellipse is an oval (or oval curve) that lies entirely 
on one nappe of the cone. Ellipses will be studied in greater detail in 
Section 6.3. 

In case 0 < B < a, the conic is called a hyperbola. In this case 


(6.241) IPF| = elPD\, 


where the eccentricity is a constant efor which e > 1. As we can see from 
Figure 6.22, a hyperbola consists of two branches (or parts) one of which 
is contained in each nappe of the cone. Hyperbolas will be studied in 
greater detail in Section 6.4. 

The above information enables us to find the equation of a nontrivial 
conic K (parabola, ellipse, or hyperbola) which lies in an xy plane when we 
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know the eccentricity e, the coordinates 
(x1,y1) of a focus F, and the equation 
Ax + By + C =0 of a directrix L. While 
different values of e yield conics of different 
shapes, the schematic Figure 6.25 may be 
Figure 6.25 helpful. The formula [PEI = e|PD| is is saul. 
alent to the formula |FP|? = e|PD|?, a 
use of formulas for distances from points to points and from points t to 
lines (see Theorem 1.48) enables us to put this in the form 


2 
(6.251) (x = #1)? + (y = y1)? = Gr ppa (4x + By + C). 


While the equation (6.251) has its virtues, we can obtain a more inform- 

ative equation by choosing the x,y coordinate system in such a way that 

the focus F lies on the x axis and the 

directrix is perpendicular to the x axis. 

D P(x,y) With the intention of so determining x, 

that the resulting equation will have its 

simplest form, we suppose that the focus F 

x=x,-p F(x;,,0) x has coordinates (%,0) and that the direc- 

Figure 6.26 trix lies a given positive distance # to the 

left of the focus (as in Figure 6.26) so 

that the equation of the directrix is x = x, — p. The intrinsic equation 
\FP|? = e?|PD|? then gives the coordinate equation 


(x — #1)? + y? = elx — (m1 — p)? 


Or 
(6.261) (1 — e)a? + Uel — p) — wile + y? = el — p)? — x}. 


We can now begin to see how the nature of the equation depends upon the 
eccentricity e. In case e = 1, so that the conic is a parabola, (6.261) 
reduces to 


1 
(6.262) x= Ip y2 + (= — 2) 


This equation has its simplest form when xı = ~/2. The simplest equa- 
tion has the form x = ky? which (except that the roles of x and y were 
interchanged to simplify matters) was studied in Section 6.1. 

We now face the task of simplifying (6.261) for cases in whichO < e < 1 
or e > l and the conic is an ellipse or a hyperbola. As is easy to guess, 
the greatest simplification results from so choosing xı that the coefficient 
of xis0. Therefore, we let xı be determined by the equivalent equations 


(6.27) e(x1 — p) — xı = 0, x, = —*p xı— p = —?P 


1 —e 
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Putting this value of xı in (6.261) gives the equation 


(6.271) (1 — 2%)? + y? = e*p? 


l — 2? 


This equation of the conic is a good source of information, but we obtain 
a better source by putting the equation in the ‘“‘standard form.” To do 
this neatly and correctly, we sacrifice some paper to put (6.271) in the 
form 


(1 — e?)x? y2 _ e?p? 
l + 1 T-@2 


3 


divide the numerator and denominator of the first term by (1 — æ?) to 
obtain 


x rL 
n a e 


and then divide both members of this equation by the right member to 
obtain an equation which is put in the form 


(6.272) 4+ Y= 1, (ellipse) 


when 0 < e < 1, and in the form 


x2 y? 
(6.273) “ep — PZ = 1, (hyperbola) 


(e? — 1} ¢? —] 
when e > 1. Everything is so arranged that the denominators in (6.272) 
and (6.273) are positive. 

Unless we think a bit about sections of cones, we cannot fully appreci- 
ate the significance of these formulas. It is sometimes said that any 
reasonably sane person should feel quite sure that an ellipse is an egg- 
shaped oval which has a “small end” at the part of the ellipse nearest the 
vertex of the cone and which has a “big end” at the part of the ellipse 
farthest from the vertex of the cone. However, (6.272) shows very 
clearly that the x and y axes are axes of symmetry of the ellipse and that 
the origin is a center (center of symmetry) of the ellipse. Thus (6.272) 
reveals the astonishing fact that the ellipse has a center and that the two 
“ends” of the ellipse are alike (or congruent). Similarly, suppose a 
particular hyperbola H intersects one nappe of a cone at some points 
near the vertex of the cone but intersects the other nappe only at points 
very far from the vertex of the cone. It would seem to be incredible that 
the two branches of this hyperbola should be alike, but they are alike. 
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The formula (6.273) shows that the x and y axes are axes of symmetry of 
the hyperbola and that the origin is a center (center of symmetry) of the 
hyperbola. This proves that the two branches of the hyperbola are 
congruent. 

There are two reasons for putting the above equations in the simpler 
standard forms 

2 2 

(6.28) AtS, Xl, 
where a and b are positive constants. In the first place the denominators 
in (6.272) and (6.273) are clumsy things to write, and in the second place 
equations of the form (6.28) often arise in problems where a and b are not 
children of eccentricities. Comparing (6.272) and (6.273) with (6.28) 
shows that a and b are determined in terms of e and p by the formulas 


6.281 = eP ? b = — P _ 
(6-281) "=E e] VIl — e] 


On the other hand, the formulas 


2 
(6.282) laj = Toa = ae = distance from center to focus 


(6.283) |x. — pl = TA = = = distance from center to directrix 
(6.284) ae = a? — b? for ellipse, ae = V/a? +b? for hyperbola 


serve to determine other quantities in terms of a and b. The first two of 
these formulas are obtained very quickly by comparing the formulas for 
xı and xı — > in (6.27) with the formula for a in (6.281). To obtain 
(6.284), we can square the members of (6.281) and combine the results to 
obtain 5?/a? = |1 — e?| and then treat separately the cases in which 
O<e<lande>1. Observe that 0 <b <a when 0 <e¢ <1 and 
also L < e < +/2 but that b > a when e > +/2. Graphs of ellipses and 
hyperbolas, and schemes for remembering essential parts of the above 
formulas, will appear in later sections. 


Problems 6.29 


I Copy Figure 6.25 and the equation (6.251) of the conic K and look at 
them. Then show that the equation of K can be put in the form 


o (1-a Tp) = frp t (1 Faw)” 


~2(s:+ ie) -2(91+ e?BC ae y = oa 


A? + B? A? + B? + B? — (x it yi): 
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This equation is ponderous and nobody should ever dream of remembering it, 
but it is useful. 

2 With the aid of the result of Problem 1, show that the coefficient of xy 
in the equation of a conic K is zero if and only if each directrix of K is parallel 
to one of the coordinate axes. 

3 With the aid of the result of Problem 1, show that if the coefficients of 
x? and y? in the equation of a conic K are both 0, then e = \/2 and 4? = B? 
Show that when ¢ = 4/2 and B = 4, the equation reduces to 


sity C 


(2) syt (at $)e t (n+ E) = 7 YA 


Show that when ¢ = +/2 and B = — A, the equation reduces to 


(3) ny — (ati) -b-h -5-4 


The graphs of these equations are hyperbolas because ¢ = 1/2 > 1. 
4 With the aid of Problem 3, show that if e = 1/2, B = A, xı = —C/A, 


and yı = —C/ 4A, then the equation of the conic K is 
C? 
(4) xy = 7J 


Show also that if e = 4/2, B = — 4A, x, = —C/A, and yı = C/A, then the equa- 
tion of the conic Ķ is 


C2 
(5) xy = -7J 


Our equations are now much simpler. 

5 Let k >0. Observe that formula (4) of Problem 4 reduces to xy = k 
when 4 = l and C = — +/2k. This shows that xy = k is the equation of the 
conic having a focus at the point (v/2k, V2k), having a directrix with the 
equation 


(6) x+y — VS2k =0, 


and having eccentricity e = 1/2. Remark: Relatively few persons have enough 
courage to undertake to sketch or otherwise describe a cone in Es which intersects 
the xy plane in the graph of the equation xy = k and then use methods of syn- 
thetic geometry (geometry which, unlike analytic geometry, never uses algebra 
and other brands of mathematical analysis) to obtain information about the foci 
and directrices of the conic. However, the results of this problem enable us to 
put this information in very simple terms. Supposing that k > 0, we start with 
a good clean x, y coordinate system and show how to locate the foci and directrices 
of the hyperbola having the equation xy = k. The point V(«/k, vk) clearly 
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lies on the hyperbola, and we start by 
locating it in Figure 6.291. We then 
enter the wholesale sketching business 
and sketch the line M containing O and 


S = A Fi(V2k, ¥2k) y , the circle C with center at O which con- 

xe JE vB tains VY, and the square S whose sides are 

Iz | X tangent to C at the points where C inter- 
sects the coordinate axes. 


The points 

Fy (+/2k, 2k) and F(— VZk, — VIR) 

where the square S meets the line M are 

=f2k the foci of the hyperbola. The line L, 

joining the points where the circle and 

square meet the positive x and y axes is 

Figure 6.291 one directrix of the hyperbola. The line 

L: joining the points where the circle and 

square meet the negative x and y axes is the other directrix of the hyperbola. 

Finally, we complete Figure 6.291 by sketching the hyperbola. As we know, the 

coordinate axes are asymptotes of the hyperbola The origin is the center (the 

center of symmetry) of the hyperbola. The line M is a line of symmetry; it is 

called the transverse axis of the hyperbola. The line through O perpendicular to 

M is another line of symmetry; it is called the conjugate axis of the hyperbola. 

The particular hyperbola we have been studying is called a rectangular hyperbola 
because its asymptotes are at right angles to each other. 

6 Information theory teaches that results like those of Problem 5, which 
depend upon considerable calculation, should be checked when it is relatively 
easy to do so. Letting F be the point (/ 2k, ~/2k), letting L be the line having 
the equation x + y — »/2k = 0, letting D be the foot of the perpendicular from 
P to L, and letting e = +/2, simplify the intrinsic equation |FP|? = e?|DP|? to 
obtain the coordinate equation xy = k. Remark: We are seldom required to 
find the distance from a point to a line which is not parallel to a coordinate axis, 
and it is helpful to be able to find Theorem 1.48, which enables us to obtain the 
result very quickly. 

7 Find the set of numbers k such that there exists at least one point (x,y) 
whose coordinates satisfy the equation y = kx and the equation 


(2)? +y =l (b) xy = (e) xy = —1 
2 2 2 2 


In each case, sketch a figure which shows the geometric significance of the result. 
8 Tell why a circle is not an ellipse. Remark: The answer must be based 
upon definitions and not upon intuitions of the untutored. 
9 When 4 = 0 and B > 0, the graph of the equation 


y? = Ax? + B 


is a central conic (circle or ellipse or hyperbola but not a parabola or a degenerate 
conic) having its center at the origin. Show that if m and b are constants such 
that the line having the equation 


y = mx +b 
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intersects the conic at two points, then the mid-point of the chord of the conic 
joining these points has coordinates (x,y), where 


op P” __ _ m +24 — m’) 
* = 7G ny ISDA nA 


Use this result to show that, for each fixed m, the set D of mid-points of chords 
having slope m lies on a line through the center of the conic. Remark: This set 
D, which is always a line segment when the conic is a circle or an ellipse and is 
sometimes a whole line when the conic is a hyperbola, is called a diameter of the 
conic. 

10 Let m bea positive constant. A surface S contains the origin, and when 
y # 0, it contains the circle which lies in a plane parallel to the xz plane and has 
a diameter coinciding with the line segment joining the two points (0,y,0) and 
(0,y,2my) in the yz plane. Sketch a figure showing S and show that the equation 
of S is 


x? + (z — my)? = my? 


Remark: One who wishes to rise above minimum requirements may show that 
(i) S is a quadric surface, (ii) S is a cone and hence is a quadric cone, (iii) S is 
not a right circular cone. One who wishes to rise to still greater heights may 
try to decide whether we know enough to determine whether the cone has an 
axis and, if so, whether sections made by planes perpendicular to this axis are 
ellipses. 


6.3 Ellipses Remarkable geometric properties of ellipses can be 
extracted from Figure 6.31. This 
figure, like Figure 6.22, shows a 
cone having a vertical axis. The 
axis lies in the plane of the paper. 
The plane r intersects the cone in 
an ellipse E of which the two points 
(vertices, in fact) VY; and F> lie in 
the planeof the paper. Thesmaller 
circle represents a sphere which is 
tangent to the cone at the points of 
a circle which determines the plane 
rı and is tangent tom atFı. Aswe 
saw in Section 6.2, mı and r inter- 
sect in a line L; and, moreover, F is 
a focus and L; is a directrix of the 
ellipse #. The larger circle repre- 
sents a sphere which is tangent to 
the cone at the points of a circle Figure 6.31 
which determines the plane r: and 

is tangent to r at Fə. The planes rz and r intersect in a line Le, and the 
same procedure which was applied to F; and L, shows that F, is another 
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focus and L» is another directrix of the ellipse E. Thus & has two foci 
and two directrices. 

We can look at Figure 6.31 and make some informal observations that 
we shall not (and perhaps cannot) make precise. If V,and Fare nearly 
equidistant from V, then the ellipse is nearly circular, the foci are close 
together and nearly midway between Ñ, and V2, and the eccentricity e is, 
as (6.231) shows, nearly zero. If we keep V; where it is and replace V, by 
a point many miles up on the cone, then the eccentricity will be near 1, 
the ellipse will be relatively flat, and the part of the ellipse within a few 
miles of ¥; would look so much like a part of a parabola that very careful 
inspection of this part would be required to enable us to tell whether the 
conic is an ellipse or a parabola or a hyperbola. 

Our next step is to use Figure 6.31 to obtain the famous string property 


(6.32) |F,P| + |FoP| = |77 


of the ellipse £, which shows that the sum of the distances from the foci 
of an ellipse to a point P on the ellipse has the same constant value for all 
points P on the ellipse. Let P be VY; or V2 or any other point on the 
ellipse. The line VP lies on the cone and is tangent to the lower and upper 
spheres at points 4; and 42. Then, as was pointed out in Section 6.2, 
|PF;| = |P] because the two vectors have their tails at the same point 
and are tangent to a sphere at their tips. Also, |PF2| = |P4,| for the 
same reason, the upper sphere now being involved. Therefore, 


(6.33) IFP] + |FLP| = [AP|\ + [PA = (Aa. 


Wherever the point P may be on the cone, the number [4,4] is the con- 
stant slant height of the segment of the cone that lies between the parallel 
planes mı and r2; in fact if d is the distance between m, and rə, then 
IZA] = d/cos a, where a is the angle at the vertex of the cone. The 
points V;, Fi, F2, and F2 all lie on the line in which v intersects the plane 
of the paper. The results of setting P = V;, and then P = Vz, in (6.33) 
give 


? 


d > => d 
? |F a] + |F: = 


COS & COS @& 


(6.331) |F Fl + IHA = 
and with the aid of Figure 6.31 we can put this in the form 


-a > —— > d 
(6.332) 2| PF] + IFF = |FiF ol + 2E = — 


COS @& 


This gives the remarkable fact that 


(6.333) AA = IFF a]. 
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With the aid of Figure 6.31 and these formulas, we find that 
(6.334) IPP = PF] + Fil + [FeV 


= UF | + Fil = —— = [ZA 
COS a@ 

From this and (6.33) we obtain the string property (6.32). One reason 
for interest in the string property of ellipses lies in the fact that it provides 
a mechanical method for drawing ellipses. Let a string of length 2a have 
its ends pinned totwo points Fand F,on a sheet of paper. Using a pencil 
point to stretch the string into two straight segments, we can move the 
pencil so that its point draws an ellipse having foci at Fı and F» as the 
string slides over the pencil point. 

Figure 6.34 shows an ellipse which was drawn with the aid of the string 
property, and it also shows some numerical dimensions which display 


Figure 6.34 


information from the paragraph containing (6.281). Even though the 
equation of the ellipse of the figure has already been derived, it is worth- 
while to know about the operations involved in using the intrinsic string 
property |F,;P| + |F2P| = 2a to derive the equation. Letting Fy(—ae,0) 
and F,(ae,0) be located on the x axis with the origin midway between them 
as in Figure 6.34, we use the string property to obtain the uninformative 
equation 


(6.35) V(x + ae)? + y? + V(x — ae)? + y? = 2a, 


which should be simplified. If we square the members of this equation; 
the product of the two square roots will complicate our calculations. It 
is better to transpose one of the square roots (we select the second) and 
square and simplify the result to obtain 


V(x — ae)? + y? = a — ex. 
Squaring and simplifying this gives 
(1 — e) + y? = a®(1 — e), 
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and dividing by the right side gives the standard form 


2 2 
(6.36) +5 =1, 
where b? = a?(1 — e?) or 


(6.37) ae = Væ — bh. 


It is of interest to see that memorization of only a few details enables us 
to find the graph, foci, and directrices of the graph of (6.36) when a and b 
are given constants, say a = 5 and b = 2. Putting y = 0 shows that 
points (2,0) and (—a,0) lie on the graph. The line segment joining these 
points is the major axis of the graph. Putting x = 0 shows that the 
points (0,b) and (0,—b) lie on the graph. The line joining these points 
is the minor axis of the graph. The graph is an ellipse through these four 
points. The foci always lie on the major axis. With or without the aid 
of the string property of the ellipse, 
we can remember that the foci lie on 
the major axis and on the circle of 
radius a having its center at an end of 
the minor axis. Then Figure 6.38, 
which is a simpler version of Figure 
6.34, shows that the distance from the 
center of the ellipse to the foci can be 
calculated by the Pythagoras theorem. 
Figure 6.38 The distance is ya? — b?, and if we 

will remember that this is ae, then we 
can calculate e. Finally, we can calculate the distance from the center to 
the directrices if we remember that this distance is a/e. The numbers 
ae and a/e are the key numbers. 


Problems 6.39 


1 For each of the following pairs of values of a and b, sketch the ellipse 
having the equation 


x? y? 
ate) 


find the eccentricity, find the foci (give coordinates), and find the directrices 
(give equations). Try to cultivate the ability to use the Pythagoras theorem and 
key numbers without use of books or notes. Check the numerical results by use 


of the fact that the distance p from a focus to its directrix must satisfy the equa- 
tion ¢?p? = 52(1 — e°”). 


(a) a=5,b=2 (b) a=3,b=1 (c)a=5,b=4 
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2 The equation 
x? y? 
zt 33 =I 


differs from equations of ellipses having their foci on the x axis because the 
denominator under x? is not greater than the denominator under y2. Never- 
theless, plot the four points on the graph obtained by setting x = 0 and then 
y = 0, and then sketch the graph. Observe that everything is like the preceding 
problem except that the roles of x and y are interchanged. Then proceed to 
find the eccentricity, foci, and directrices. Repeat the process when 2 and 3 
are respectively replaced by 


(a) 2 and 5 (b) land 5 (c) 3and 5 
3 We have known for a long time that the graph of the equation 
(æ = A)? + 9 = A)? = a 


is a circle having its center at the point (h,k). With this hint, sketch graphs of 
the equations 

— 2 — 2 — 2 — 2 
SEM, oD pegea 
Observe that, in these cases, distances from centers to foci are not coordinates 
of foci; suitable adjustments must be made. Remark: A good clean start is 
made by setting y = 2 and calculating x — 1 and then x. 

4 Find the equations of the ellipses (if any) which have foci at the points 
(—2,0) and (2,0) and which pass through the point (1,1). 

5 Find the equation of the ellipse which has its center at the point (2,3), 
which has axes parallel to the coordinate axes, and which is tangent to the coordi- 
nate axes. Sketch a reasonably good figure. 

6 The foci of a particular ellipse lie midway between the center and vertices. 
Find the eccentricity. Supposing that the major axis has length 2a, find the 
length of the minor axis and the distance from the center to the directrices. 
Sketch a reasonably good figure. 

7 Except for minor perturbations, the orbit of the earth is an ellipse having 
the sun at a focus. The least and the 
greatest distances from the earth to the Figure 6.391 
sun have the ratio $$. Find the ec- 
centricity of the approximate orbit. 
Ans.: Bg. 

8 As in Figure 6.391, let P:(x1,41) be 
a point on the ellipse having the stand- 
ard equation 


T, 


Pi(%9) 


x? y? 
atp 


1. 


Supposing that P, is not one of the 
points where the ellipse intersects the 
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coordinate axes, find the equation of the line T, tangent to the ellipse at 


Py. Ans.: 


xı% yy 
at pe st 


9 Find the coordinates of the points where the tangent Tı of Problem 8 
intersects the coordinate axes. Ans.: 


2 2 
G9) e3) 
x} yı 


10 Find the coordinates of the points where the tangent Tı of Problem 8 
intersects the lines through the foci perpendicular to the major axis. Ans.: 


(et) (20-2) 


11 Find the coordinates of the points where the tangent Tı of Problem 8 
intersects the directrices of the ellipse. Ans.: 


_ 2 b? (1 4 a) (5 b? (1 — 2). 
e yı ae e yı ae 

12 Let the line T, tangent to an ellipse at P, intersect a directrix at Qı and 
let F be the focus corresponding to the directrix. With the aid of Problem 11 
and the fact that b? = a?(1 — e?), prove that the line FQ; is perpendicular to the 
line FP,. Remark: This result has some quite surprising consequences. If the 
focus F, directrix D, and one single point P(«1,y1) of an ellipse are marked in a 
plane, we can give a simple rule for drawing the line T which is tangent to the 
undrawn ellipse at P;. In case P; is on the 
line through F perpendicular to D, the 
tangent 7; is the line through P, parallel to 
D. Otherwise, the tangent T, is the line 
containing P, and the point Q, where the 
line through F perpendicular to the line FP; 
intersects the directrix. This result implies 
that if Pı and P are points at the end of a 
focal chord (a chord containing a focus), then 
the tangents at Pı and Pz intersect at the point 
Qı on the directrix where the line through the 
focus perpendicular to the line P\P2 meets the 
directrix. Figure 6.392, in which a part of 
the ellipse is drawn, illustrates this elegant 

geometric fact. 
Figure 6.392 13 Figure 6.392 illustrates another in- 
teresting geometric fact. Then line OP, 
and the line through F perpendicular to the tangent T, at Pı intersect at a point 
A on the directrix. Prove the fact by proving that each line intersects the direc- 

a ayy 


trix at the point 4 (£ — }. 
l exi 
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14 Figure 6.393 illustrates the fact that the line XP, from a vertex V; to 
a point P, on an ellipse is parallel to the line OE from the center of the ellipse to 
the point E where the tangent at P, intersects the tangent at the other vertex V2. 


. b? 
Show that the coordinates of E are (a, zı (1 — “1)) and prove the fact. 


F (ae,0) (¢. 0) 
e 
Figure 6.393 Figure 6.394 


15 Figure 6.394 shows a part of the ellipse having, as usual, the standard 
equation 
x? y? 

(1) ait 52 


in whichO <b < a. Let G be the point in the first quadrant where the ellipse 
is intersected by the line through the focusF parallel to the directrix. Prove that, 
as the figure shows, the coordinates of G are (ae, b?/a). Remark: The result can 
be obtained by use of the fact that the y coordinate of G is the product of e and 
the distance (a/e — ae) from G to the directrix. It can also be obtained by put- 
tingx = aein (1). Ineach case, it is necessary to use the relation b? = a2(1 — ¢?). 

16 Using the notation and results of Problem 15, show that the equation of 
the tangent to the ellipse at G is ex + y = a. Remark: This shows that the 
tangent intersects the y axis at the point (4,0) and intersects the x axis where 
the directrix does. These results are illustrated in Figure 6.394. The circle 
which has its center at the center of the ellipse and contains the vertices is called 
the major circle of the ellipse. Thus the tangent ai G intersects the major axis 
where the directrix does and intersects the minor axis where the major circle does. 
This is one of many elegant geometric theorems that have fascinated men for 
centuries. 

17 Supposing that P(x,y) is a point on the ellipse having the standard equa- 
tion (6.36) which we should now know, use the information in Figure 6.38 (which 
we should remember) and the distance formula to show that 


a? + Jato biz a? — /a — Bex 
3} ———— + 
a 


|F,P| = IFP] = 


Then seek a way in which formulas involving eccentricity can be used to obtain 
the same result. Note that |F,P| + |F2P| is what it should be. 
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18 Study Figure 6.395 and discover the procedure by which the encircled 
points are determined, and then use the procedure to obtain another encircled 
point. Prove that the set of points obtained 
by this procedure lie on an ellipse. Solution: 
If the inner and outer circles have radiuses 
(this time we use English; the Latin is radii) 
a and b, and if a line is drawn through the 
origin making the angle 6 with the positive x 
axis, then the coordinates (x,y) of the resulting 
encircled point are 


(1) x = a cos 8, y = b sin 8. 


From these equations we obtain 


x2 ay? . 
(2) z t Re = cos? 8 + sin? @ = 1, 
Figure 6.395 a 


so the point (x,y) lies on an ellipse. 
19 Let0 < b< aand let w>0. Let a particle move in a plane in such a 
way that, at each time ż, the vector running from the origin of an x, y coordinate 
system to it is 


r = (a cos wt)i + (b sin wi)j. 


Show that its path is an ellipse. Show that it is always accelerated toward the 
origin and that the magnitude of the acceleration is 


nrn 
w? 4b? + (a? — b?) cos? wi. 
Hint: To get started, let x = a cos wt, y = b sin wt, and observe the result of 


some dividing and squaring and adding. 
20 <A point P moves around an ellipse having foci 


(1) F\(— Va? — b?, 0), Fa(V a — b?, 0) 


in such a way that the vector r running from the origin to P at time ż is 


(2) r = (a cos t)i + (b sin 4)j. 

Show that the vector 

(3) = —(a sin t)i + (b cos t)j 

is a forward tangent to the ellipse at P. Show that 

(4) F,P = (a cos t + A V/a? — b) i + b sin j, 

where A = 1 when k = 1 and X = —1 when k = 2, and then show that 
(5) |F P|? = a? + 2ad v/a? — b? cos t + A2(a? — b?) cos? t 

and hence 


(6) IFPI =a +À Va — b? cost. 
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Show that 
(7) [v] = Vb + (a? — b?) sin? t. 
Show that 


(8) F,P-v = —A Va — b? sin tla + AX Va — b? cos il. 


Letting x be the angle which the vector FP makes with the forward tangent V 
at P, show with the aid of (6), (7), and (8) that 


FP _ —\ Va — b sint 

IF.P\lv| v/b? + (a? — 8?) sin? t 

Show that y = b sin ¢ and that multiplying the numerator and denominator of 
the last member of (9) by b gives the formula 


-A Va — By 
VETE 
Remark: These remarkable formulas yield the famous reflection property of 
ellipses. Since A = 1 when k = 1 and A = —1 when & = 2, the numbers 
cos ġı and cos ¢z in (10) differ only in sign. This implies that the vectors 
FP and FP make supplementary angles with the forward tangent v to the ellipse 
at P and hence that the lines F\P and FP make equal angles with the normal to 
the ellipse at P. This implies that if light or something else goes in a line from 
F, and is reflected from the ellipse in such a way that the angle 6, of reflection 
is equal to the angle 0, of incidence, then its path leads to F}. Moreover, because 
of the string property of the ellipse, radiation leaving F, at the same time but in 
different directions will arrive simul- 
taneously (or in phase) at F2. 

21 Thereflection property of ellipses 
is a consequence of another interesting 
geometric property of ellipses. Let the 
line T of Figure 6.396 be tangent at P 
to the ellipse having foci at Fı and Fo. 
Let H, and H.: be the reflections in T 
of F, and Fz; this means that T is the 
perpendicular bisector of the line 
segments FıHı and FH Then, as 
indicated by the figure, the line seg- 
ments F,H: and FH, intersect at P. 
To prove this fact, let 4 be any point Figure 6.396 
on T different from P and let B be the 
point at which the line segment F4 intersects the ellipse. Then (why ?) 


(9) cos Ọk = 


(10) COS Pk = 


(1) (F| + [BF] < [FA] + [AF 
so (why?) 

(2) [FPI + PF: < [F4] + [AF 
and (why?) 


(3) IF| + IPF: < [F2] + ZE. 
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Hence (why?) P must lie on the line Fi. Similarly (why?) P must lie on the 
line F2H;. This proves that the lines FiH2 and F2ff; intersect at P. Therefore 
(why?) the angles FıPG, and F2PG2 are equal. 

22 This problem involves a nested family of curves containing many ellipses 
and one circle. Let a bea given positive number. For each positive number b, 
the graph of the equation 


x? y? 
ato} 


is an ellipse (or a circle) which intersects the x axis at the points (—a,0) and 
(2,0). It is easy to generate interest in these graphs by sketching some of them. 
When 0 < x; < a, the line having the equation x = x; intersects each of these 
graphs at two points. Prove that the tangents to these graphs at these points 
all intersect at a point on the x axis. Solution: Using the result of Problem 8 
(or working out the result again) shows that each tangent intersects the x axis 
at the point (a?/x,, 0) which does not depend upon b. 

23 For each @ for which 0 < 0 < 1/2 and 0 = w/4, the graph of the equation 


x? y? 
sin? 6 T costo} 
is an ellipse. Sketch several of these graphs. 

24 The members of a family of confocal ellipses have foci at the points 
(—1,0) and (1,0). Sketch good approximations to six of them. Suggestion: 
Do not work too long on an easy problem. Select a point (0,5) and make Figure 
6.38 tell you what the a in (a,0) must be. 

25 Supposing that an ellipse E is given, tell how our little sister can use her 
new drawing equipment to locate the center, the axes, and the foci of E. Hints: 
The mid-points of parallel chords of E lie on a line through the center C of E. 
It is easy to choose r such that the circle of radius r having center at C intersects 
the ellipse in four points. 

26 A rod of length L has a red end, a blue end, and a pink dot P at distance 
q from the red end. Suppose that O < g < L. Show that if the red end is on 
the x axis and the blue end is on the y axis, then (except when g = $) P must lie 
on an ellipse. Ans.: The equation of the ellipse is 


x2 y? _ 
Go-gteg-' 


27 Let F be a point which is inside a circle C but is not the center of C. A 
Figure 6.397 little preliminary sketching shows that the set S of points 
equidistant from C and F looks much like an ellipse 

having a focus at F. What are the facts? Solution: As 

in Figure 6.397, let Fı be the center of the circle and let 


Po F: be the point F. The condition that P be equidistant 
© © from C and F, can be put in the form 
F OF; 
(1) r — |F,P| = [FP] 


where r is the radius of the circle. Before undertaking 
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to express this result in terms of coordinates, we can suddenly realize that it 
is almost familiar. If we put it in the form 


[FP] + [FP] = 7, 


we see an expression of the string property of an ellipse. Therefore, S is an 
ellipse having foci at the center of the circle and the given point F. 

28 Sketch a figure like Figure 6.31 in which the distance from V to V2 is 
about 10 or 20 times the distance from V to V;. Try to decide whether the 
center of the ellipse is on the axis of the cone. 

29 Remark: Figure 6.31 presents an interesting problem in plane geometry. 
When two lines through V and a line r are given, we can use a ruler and compass 
to construct the circles of the > figure. We can then wonder whether we can give 
a simple proof that |X F| = |F2/.| without use of the cone and planes and spheres 
that were employed in the proof in the text. Perhaps consideration of this 
problem will increase our respect for the methods that were employed. 


6.4 Hyperbolas Geometric properties of hyperbolas can be extracted 
from Figure 6.41, which, like some 
preceding ones, shows a cone hav- 
ing a vertical axis. The axis lies in 
the plane of the paper. The plane 
x intersects the cone in a hyperbola 
H of which the two points X, and 
Va (vertices, in fact) lie in the plane 
of the paper. The upper circle re- 
presents a sphere, in the upper 
nappe of the cone, which is tangent 
to the cone at the points of a circle 
which determines the plane r, and 
is tangent tom atFı. As we saw in 
Section 6.2, mı and r intersect in a 
line Lı and, moreover, F; is a focus 
and Lı is a directrix of the hyper- 
bola H. The lower circle repre- Figure 6.41 
sents a sphere, in the lower nappe of 

the cone, which is tangent to the cone at the points of a circle which 
determines the plane m: and is tangent to m at Fz. The planes r2 and r 
intersect in a line Lz, and the same procedure which was applied to F, and 
Lı shows that F is another focus and L is another directrix of the hyper- 
bola H. Thus H has two foci and two directrices. 

Hyperbolas have a string property which involves the difference (not 
sum) of the distances |F,P| and \F,P| from foci to points on the hyperbolas. 
Let P be F, or any other point on the upper branch of H. The line VP 
lies on the cone and is tangent to the lower and upper spheres at points 
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A, and A;. With the aid of Figure 6.41 and familiar facts about vectors 
tangent to spheres at their tips, we see that 


(6.42) \F.P| — \F:P| = [ZP] — [4,P| = [Ao Ail. 


Wherever the point P may be on the upper branch of H, the number 
Z4: is the constant sum of the slant heights of two conical segments 
having their vertex at V and their bases in the planes rı and rs; in fact if 
d is the distance between the planes mı and me, then |424,| = d/cos a, 
where a@ is the angle at the vertex of the cone. In particular, letting 


P = V, shows that 


IFF l — [VF | = |4.A| 
and hence 
(6.421) [FV ol + VP — [PF] = |414. 


In case P lies on the lower branch of H, we can reverse the roles of the 
subscripts 1 and 2 to obtain the formulas 


FiP| — \FoP| = |Ai4l, |F — Fa = |434 

and 
(6.422) |El + IFP — (Wok ol = |414- 
Adding the formulas (6.421) and (6.422) shows that 2|V2V,| = 2|4,4,| 
and hence that the first of the formulas 
(6.423) |424| = WA IFF a = IRIA 
is valid. The second formula is a consequence of the first and (6.421). 
All this implies the string property of the hyperbola having foci at F, and 
F, and vertices at V; and V2. If P is on the hyperbola, then 
(6.43) FiP| — IFP] = |774, 
the plus sign being required when P is on one branch and the minus sign 
being required when P is on the other branch. 

Figure 6.44 shows a hyperbola and also some numerical dimensions 
Figure 6.44 


y 
Directrix Directrix~, 
K u PT 
\ 
H t 
F;(- ae,0) | -i IS ZN, | Fx(ae,0) x 
AK 
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which display information from the paragraph containing (6.281). Even 
though the equation of the hyperbola of the figure has already been 
derived, we become acquainted with useful ideas by using the intrinsic 
string property 


(6.45) FP] — |FP| = 2a 


to derive the equation. Letting Fi(—ae,0) and F,(ae,0) be located on the 
x axis with the origin midway between them as in Figure 6.44, we use the 
string property to obtain the equation 


v (x + ae)? + y? — V(x = ae)? + y? = Ra. 
Transposing the second term, squaring, and simplifying give 
+ V(x — ae)? + y? = a — ex. 
Squaring and simplifying this give 
(e — 1)x? — y? = ar(e? — 1), 
and dividing by the right side gives the standard form 


x? y2 


(6.46) L-z = l, 
where b? = a?(e? — 1), or 


(6.47) ae = V a + b. 


As was the case for ellipses, it is of interest to see that memorization of 
only a few details enables us to find the graph, foci, directrices, and asymp- 
totes of (6.46) when a and b are given constants. Putting y = 0 shows 
that the points Vi(—a,0) and V2(a,0) lie on the graph. The line through 
these points is the principal axis of the hyperbola, and the foci lie on it. 
Putting x = 0 shows that the hyper- 
bola contains no points on the y axis, 
but we mark the points (0,5) and (0, —5) 
anyway. The line through these points 
is the conjugate axis of the hyperbola. 
The next steps are to draw the box 
having horizontal and vertical sides 
containing the four points and then Figure 6.48 
draw the diagonals of the box. These 
diagonals are, as we showed in Problem 7 of Section 3.3 and shall quickly 
show again, asymptotes of the hyperbola. A reasonable approximation 
to the hyperbola can now be sketched very quickly; it is tangent to the 
box at the two vertices we have found, and a pencil point which traces a 
part of the hyperbola becomes steadily closer to the asymptote as it 
recedes from a vertex. For the part of the hyperbola in the first quad- 
rant, this result comes from the fact that 


b _b er ee É 
(6.481) a” z V= P= Jai 
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and for other quadrants we use symmetry. We can use the Pythagoras 
theorem to calculate the distance from the center to a focus if we remem- 
ber Figure 6.48, a simplified version of Figure 6.44, which shows that if we 
fix one point of a compass at the origin, then the circle through the corners 
of the box meets the principal axis at the foci. If we remember that this 
distance is ae (as it also is for ellipses), then we can calculate the eccen- 
tricity e. Finally, if we remember that the distance from the center to a 
directrix is a/e (as it also is for ellipses), we can calculate this distance. 
The numbers ae and a/e are still the key numbers. 


Problems 6.49 


1 For each of the following pairs of values of a and b, sketch the hyperbola 
having the equation 
x2 2 
r pa) 


find the eccentricity, find the foci (give coordinates), find the directrices (give 
equations), and find the asymptotes (give equations). Try to cultivate the 
ability to use the Pythagoras theorem and key numbers without use of books or 
notes. Check numerical results by use of the fact that the distance p from a 
focus to its directrix must satisfy the equation ¢2p? = 5?(e? — 1). 


(a)a=5,b=2 (6) a@=2,b=5 (c)a=b=1 


2 The equation 


differs from equations of hyperbolas having their principal axes and foci on the 
x axis because the roles of x and y are interchanged. Nevertheless, plot the 
points on the graph obtained by setting x = 0 and then y = 0 and then draw the 
helpful box and sketch the hyperbola. Then proceed to find the eccentricity, 


foci, directrices, and asymptotes. Repeat the process when 2 and 3 are respec- 
tively replaced by 


(a) 2 and 5 (b) S$ and 2 (c) l and 1 
Remark: The graphs of the equations 

x2? y2 x 

@ poh Roan 


are hyperbolas having the same helpful box and the same asymptotes. Each 
hyperbola is said to be the conjugate of the other. 
3 Sketch graphs of the equations 


(a) eo Ge (b) 072 (x = a i 


Remark: Good clean starts are made by setting y = 2 or x = 1 and remembering 
that squares of our numbers (which are always real numbers) are never negative. 
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4 The two vertices and one focus of a hyperbola are given. Describe a pro- 
cedure by which it is possible to draw the asymptotes without using equations. 

5 For basic studies of hyperbolas, we place the hyperbolas upon coordinate 
systems in such a way that their equations have the standard form 


x2 y2 
a p 


= 1. 


Supposing Pi(x1,1) is, as in Figure 6.491, a point on the hyperbola that is not a 
y Ti Yo, 


VN” 
arn 


Figure 6.491 


vertex, find the equation of the line T, tangent to the hyperbola at P}. Ans.: 


a? 2 


6 Find the coordinates of the points at which the tangent 7; of Problem 
5 intersects the coordinate axes. Ans.: 


2 2 
Z, 0) (0, — a ° 
x} yı 
7 Find the coordinates of the points where the tangent 7; of Problem 5 
intersects the lines through the foci perpendicular to the x axis. Ans.: 


2 2 
(e - 505) (m-E0-8)) 
yı a yı a 


8 Find the coordinates of the points where the tangent 7; of Problem 5 
intersects the directrices of the hyperbola. Ans.: 


a b #1)) (2, -=( - 2)) 
(-%-= 1+2)), e yı l ae 


9 Let the line 7, tangent to a hyperbola at P; intersect the directrix at Q; 
and let F be the focus corresponding to the directrix. With the aid of Problem 8 
and the fact that b? = a?(e? — 1), prove that the line FQ, is perpendicular to the 
line FP.. 
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10 Sketch a figure somewhat like Figure 6.491 and observe that theline L; 
from a focus F perpendicular to the line T, tangent to the hyperbola at P, and 
the line OP; from the center of the world to P, seem to intersect at a point on the 
directrix. Prove that each of these lines does intersect the directrix at the point 
(a/e, ay,/ ex). 

11 Figure 6.492 illustrates the fact that the line XıPı from a vertex of a 
hyperbola to a point P; on the hyperbola is parallel to the line OZ from the center 


Figure 6.492 


of the hyperbola to the point E where the tangent T; at Pı intersects the tangent 
. b2 

at the other vertex. Show that the coordinates of Æ are (a, 7, = — 1)) and 
1 


prove the fact. 
12 Figure 6.493 shows a part of the hyperbola having, as usual, the standard 
equation 
x2? y? 


(1) pol 


Let G be the point in the first quadrant 
where the hyperbola is intersected by the 
line through the focus F parallel to the 
directrix. Prove that, as the figure 
shows, the coordinates of G are (ae, b?/a). 
Remark: A relation among a, b, and e is 
needed. 

13 Using the notation and results of 
Problem 12, show that the equation of 
the tangent to the hyperbola at G is 
ex — y =a. Remark: This shows that 
the tangent to the hyperbola at G inter- 
Figure 6.493 sects the x axis (the transverse axis of the 

hyperbola) where a directrix does and 
intersects the y axis (the conjugate axis of the hyperbola) at a point on the circle 
which has its center at the center of the hyperbola and includes the vertices of 
the hyperbola. 

14 Let Pi(x1,y:) be a point on a hyperbola having the standard equation 
x?/a? — y?/b? = 1. Find the coordinates of the points at which the tangent 7; to 
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the hyperbola at P; intersects the asymptotes of the hyperbola. Ans.: The inter- 
sections with the asymptotes y = (b/a)x and y = ~—(b/a)x are respectively 


a?b —ab? 


( a*b ab? ) ( 
bxi — ayı bx; — ayı ' bxi + ay, bxi + om) 


Remark: Since (bxı — ay)(bx; + ayı) = b?xi — a?yj = a?b?, the denominators 
are all positive or all negative. 

15 Prove that if a line T, is tangent to a hyperbola at P;, then P; lies midway 
between the points at which Tı intersects the asymptotes of the hyperbola. 

16 Find the area of the triangular region bounded by the tangent T, and the 
asymptotes of the hyperbola of Problem 5. Ans.: ab. 

17 Substantial information about the geometry of the hyperbola having the 
standard equation 


(1) in 


is concealed in Figure 6.494. The figure shows the auxiliary rectangle whose 
vertices lie on the asymptotes, and also the asymptotes. As we know, the circle 


Figure 6.494 


with center at O and radius v'a? + b? intersects the x axis at the foci. The 
smaller circles have radii a and b. The line RS is tangent to the circle of radius 
b at the point (6,0) where this circle intersects the positive x axis. Upon the 
basic part of the figure which has been described, we can heap more construction. 
Let OT be a ray (or half-line) from the origin which makes with the positive x 
axis an angle @ for which O < @ < 1/2. Let £ be the point where the ray OT 
intersects the circle having radius a, and let Q be the point where the tangent to 
the circle at 4 intersects the x axis. Let B be the point where the ray OT inter- 
sects the line RS. The point P(x,y) where the horizontal line through B intersects 
the vertical line through Q lies on the hyperbola because 


% 100! _ ecg = l ? y _ IRB! _ ang = $ 
|OA| cos ¢ b JORI cos ġ 
and hence 
x? 2 1 — sin? @ 
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Aristotle told Alexander the Great that there is no royal road to geometry, 
but it is quite easy to see what we have done and it is even quite easy to see some 
of the implications of what we have done. We have proved the theorem given 
above in italics. We have described a simple ruler-and-compass procedure for 
construction of points P on the hyperbola. We can use the construction to 
produce 4 or even 40 points on the hyperbola, but we cannot thus produce all 
of the points on the hyperbola and “construct the hyperbola.” Our result 
enables us to write the vector r from the origin O to a point P on the hyperbola 
in the form 


r = asec ġi + b tan @j, 


but ¢ is an “eccentric angle” which is not the angle between r and i. Persons 
having inherent interest in geometry may, as an extramural excursion, consider 
the special case in which the ray OT is the part of the asymptote in the first 
quadrant. Considerable geometry is associated with the fact that, in this case, 
the point Q is the focus F. To emphasize the fact that consideration of these 
things need not be tedious, we observe that if tan ġ = b/a, then cos ¢ = 
a/~/a? + b, so |OO| = V/a + b? and hence Q must be F. 

18 A glance at Figure 6.494 suggests that the directrices may be the lines 
determined by the points at which the asymptotes intersect the circle containing 
the vertices. Prove that it is so. 

19 Find the distance from a focus to a directrix of a hyperbola having the 
standard equation x?/a? — y?/ba = 1. Ans.: b?/~/a? + 5% 

20 Let a hyperbola have the standard equation x?/a? — y?/b? = 1. Let F 
be the focus and let D be the directrix in the right half-plane. Let P;(x,) 
be a point on the hyperbola in the right half-plane. Show that 


|FP,| = ex: — a. 


Let 4 be the point in which the directrix D is intersected by the line through 
P, parallel to an asymptote. Show that 


[AP,| = ex. — a. 
21 The functions in the right members of the formulas 


. e — et ef + et 
sinh t = 7” cosh t = 7 


are used often enough to justify introductions of special names and symbols for 
them. They are called hyperbolic functions, and the A in sinh ż and cosh ¢ tells 
us that the functions are hyperbolic sines and hyperbolic cosines. Show that 
if x = a cosh ż and y = b sinh ż, then 


This shows that if a particle P moves in the xy plane in such a way that, for each 
t, the vector r running from the origin to P is 


r = (a cosh kt)i + (b sinh &2)j, 
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then P traverses a branch of a hyperbola. Calculate the acceleration a and 
show that a = k'r. 

22 Copy Figure 6.48 and let a particle P move along the right branch of the 
hyperbola in such a way that the vector r running from the origin to P is, at each 
time ft, 


(1) r = (a cosh #)i + (b sinh Dj. 
Show that the vector 
(2) v = (a sinh #)i + (b cosh Aj 


is a forward tangent to the hyperbola at P. Use the figure to show that, where 
As = +1 when k = 1 and Ay = —1 when k = 2, 


(3) F,P = (a cosh £ +A V/a? +B) i + (b sinh Dj 
and then, with the aid of the simple fact that 1 + sinh? ż = cosh? ż£, show that 
(4) [F P| = a? F B cosh t + Xa. 


Letting ¢, be the angle which the vector FP makes with the forward tangent 
y at P, show that 


[FP]: y 
[F.P] 


(5) |v] cos $, = = Væ + b? sinh t 


and hence that 
VEF Èy 
Vbi + (a? + b)y? 


Remark: The formula (6) shows that ¢; = ¢2 This means that the lines drawn 
from the foci to a point P on a hyperbola make equal angles with the tangent to 
the hyperbola at P. 

23 An ellipse and a hyperbola are confocal, that is, have the same foci. 
Prove that they are orthogonal where they intersect. 
Remark: One way to prove the result is to use the fact 
that if F, and F: are the foci and P is a point of intersec- 
tion of the ellipse and hyperbola, then the vectors FP and 
FP make equal angles with the tangent to the hyperbola 
at P and also make equal angles with the normal to the 
ellipse at P. The miniature Figure 6.495 can help us 
remember the fact. 

24 The members of a family of confocal hyperbolas Figure 6.495 

have their foci at the points F\(—1,0) and F,(1,0). 
Sketch good approximations to graphs of six of them. Suggestion: Do not work 
too long on an easy problem. Make a figure which tells you where the corners of 
the handy boxes must be, and then sketch hyperbolas at the rate of two per 
minute. Remark: There is a reason for knowing about these things. If the x axis 
is an impenetrable barrier except for an opening on the interval —1 S x $1, your 
curves are paths followed by particles of a liquid or gas (or perhaps by football 
spectators) that stream through the gap. 


(6) cos ¢, = 
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25 LetO0 <b <a. Prove that when A < 5%, the ellipse having the equation 
x? y? 
Z-A EAT! 


has its foci at the points (+ ~/a? — b?, 0). Prove that, when b? < N < a?, the 
hyperbola having the same equation has the same foci. 

26 Let H bea given hyperbola. Describe an elementary geometric procedure 
for locating the center, the axes, and the asymptotes of H. Hints: Take clues 
from the hints of Problem 25 of Problems 6.39. Then use the fact that if the 
equation of H has the standard form, then y = +b when x is the length of a 
diagonal of a square of which the sides have length a. 

27 We have seen string mechanisms for construc- 
tion of parabolas and ellipses, and we need not slight 
hyperbolas. Let a red string run from a knot K so 
that it passes below a tack F; to an end tied to a pencil 
point at P as in Figure 6.496. Let a white string run 
from the same knot K so that it passes below the tack 
F, and then under a tack F: to an end tied at P. Show 
Figure 6.496 that if P is held in such a way that both strings are 

kept taut, then P will trace a part of a branch of a 
hyperbola as the knot K is pulled away from F;. Hint: Justify and use the fact 
that if the red and white strings have lengths R and X, then 


FR| = R — |FiP| = 7 — |F F4 — |FiPI 


Fy, Fy 


at all times. 

28 Sound travels with speed s, and a bullet travels with speed b from a gun 
at (—h,O) to a target at (4,0) in an xy plane. Where in the plane can the boom 
of the gun and the ping of the target be heard simultaneously? Ans.: At points 
(x,y) for which 

Vette _Ve=m Fe uh 
= P FF 
or 


V(x + hy ty? = SA (æ — h)? + y? + 2gh 


where q = s/b and hence g = 1/M where M is the Mach number of the bullet. 
In case M < 1 andq > 1, there are no such points because the length of one side 
of a triangle cannot exceed the sum of the lengths of the other two sides. In 
case M = 1 and g = 1, the required points are those for which y = 0 and x 2 h. 
In case M > 1 (so the speed of the bullet is supersonic) and q < 1, the required 
points lie on the right-hand branch of the hyperbola having the equation 


a 
gh? WL — gt) 


29 We have seen that if S is the set of points equidistant from a line and a 
circle having its center on the line, then S; is the sum or union of two parabolas. 
We have also seen that if Sz is the set of points equidistant from a circle and a 
point inside the circle which is not the center of the circle, then Sz is one ellipse. 
Complete the story by showing that if S; is the set of points equidistant from a 
circle and a point outside the circle, then S; is half (one branch) of a hyperbola. 
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30 Let the angle 4,042 of Figure 6.497 be a 


given angle between 0 andr. Everyone start- 
ing study of calculus must know that it is very 
easy to give a ruler-and-compass construction of 
the arc 4142 of a circle having its center at O 
and of the line OX which bisects the angle 41042. 
We may be too busy with other affairs to learn 
how the result can be proved, but we should 
nevertheless know that it has been proved to be 
impossible to give a ruler-and-compass construc- Figure 6.497 
tion of points P, and P2 such that the lines OP, 

and OP trisect the given angle. Pappus of Alexandria, who flourished about a.D. 
300, trisected the angle by means of hyperbolas. Let H, be the branch near 4, of 
the hyperbola having eccentricity 2 for which 4; is a focus and OX is a directrix, 
and let P, be the point at which H; intersects the circular arc 4,4. A branch 
H, of a similar hyperbola having a focus at 42 intersects the circular arc at Pa 
The definition of eccentricity and the symmetry imply that 


|ZP:| = 2|P;D| = |P\P.| = 2|DP.| = |PzA. 


Thus the three chords 4;P;, PıP2, P242 have equal lengths, and it follows that 
the two lines OP; and OP» trisect the given angle 4,04. In modern mathe- 
matics, it is important to know why the Pappus construction does not provide a 
ruler-and-compass construction of P, and Pa. It is possible to produce hordes 
of points on the hyperbolas with rulers and compasses, but it is impossible to 
produce all of them. In particular, it is impossible to prescribe rules for ruler- 
and-compass construction of the particular points where the hyperbolas intersect 
the circular arc. This matter will again be brought to our attention in Problems 
10.19. 

31 Let P;(%1,y1) be a point on the rectangular hyperbola having the equation 


(1) x? — y? = g. 


In terms of xı and yı, find the coordinates of the point (x,y) at which the tangent 
to the hyperbola at Pı intersects the line through the origin perpendicular to this 
tangent. Finally, find an equation which x and y must satisfy by eliminating 
x, and yı from your equations and the equation xj — yj = a?. Ans.: The first 
two of the required equations are 


(2) not, ye - LD 
xi tyi xi tyi 
We can square and add, and then square and subtract, to obtain 
at aë 
K R E T ETE 
It is then easy to obtain the final answer 
(4) (x2 + y2)2 = a(x? — y?). 


Remark: The graph of (4) is called the lemniscate of Bernoulli. In Problem 5 
of Section 10.1 we shall find that its polar coordinate equation is p? = a? cos 2¢. 
Its graph appears in Figure 10.171. 


390 Cones and conics 


32 This long problem should be very easy. The problem is to read about 
graphs of equations of the form 


(1) Ax? + By? + Cx + Dy + £ =0 


and to verify correctness of each assertion that is not completely obvious. In 
case AB > 0, completion of squares enables us to put the equation in the form 


(2) A(x — xo)? + Bly — yo)? = K, 


where 4 and B have the same sign, and the graph is an ellipse or a circle or a 
single point or the empty set. In case 4B < 0, the equation (1) can be put in 
the form (2), where 4 and B have opposite signs, and the graph is either a hyper- 
bola (when K = 0) or a pair of intersecting lines (when K = 0). Incase 4 = 
B = 0, the graph is a line or the empty set (in case C = D = 0 and E #0) or 
the whole plane (in case C = D = E = 0). So far we have covered all situations 
except those in which one of 4 and B is zero and the other is not. The case in 
which 4 = 0 and B #0 being analogous, we suppose henceforth that 4 = 0 
and B = 0 and that (1) has been reduced to 


x? + Cix + Diy + £, = 0. 


In case D, ¥ 0, completion of a square gives the equation y — yo = K(x — x»)? 
and the graph is a parabola. In case Dı = 0, the graph consists of two vertical 
lines or a single vertical line or the empty set. The results may be summarized. 
Depending upon the values of 4, B, C, D, E, the graph of (1) may be a conic 
(an ellipse, a hyperbola, a parabola, a circle; two intersecting lines, a single line, 
or a single point) and it may be a set which is not a conic (two distinct parallel 
lines, a plane, or the empty set). Still more information is available. The 
equation (1) is said to have elliptic type when AB > 0 even though there are 
cases in which the graphs are circles or points or empty sets. Similarly, (1) is 
said to have hyperbolic type when AB < 0 and to have parabolic type when 4 
and B are not both 0 but 4B = 0. 

33 Let 4 and B be nonzero constants not both negative. The graph of the 
equation 


(1) Ax? + By? = 1 


is then a central conic K (a circle or ellipse or hyperbola) having its center at the 
origin O. When P;(%1,y1) is a point different from the origin, the equation 


(2) Ax x + By = | 


is the equation of a line L, which is called the polar line (with respect to the conic 
K) of the point P;. Moreover, the point P; is called the polar point of the line 
Lı. These polar points and lines are very important in some parts of mathe- 

matics, and we can start learning about them. 
Figure 6.498 If the point P; lies on the conic K, then L, the 
polar line of Pi, is the line tangent to K at Py. 
When P, is not on K, matters are much more 
interesting. Suppose first that Py lies “out- 
side” the conic K so that there exist two points 
Qı and R, on K such that the tangents to K at 
Qı and R, contain P}. Then, as Figure 6.498 
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can help us remember, the polar line of P, is the line Ly containing the two points Qı 
and R;. À standard simple proof of this result is as amazing as the result. If the 
coordinates of the point Qı on K are (x2,2), then the equation of the tangent to 


K at Qi is 

(3) Axx + Bysy = 1. 
The fact that Pı(x1ı,y1) lies on this line implies that 
(4) Axx, + Byoy, = 1. 


This implies that (2) holds when x = %2 and y = yg and hence that Qı lies on the 
line Lı. When we have scrutinized this sleight of hand closely enough to under- 
stand it, we can see that the same argu- 
ment proves that R; is on Lı. Thus we 
have given geometric interpretations to 
the polar line Lı of P, for the case in which 
P; lies on K and for the case in which P lies 
outside K. Our next geometric inter- 
pretation of the polar line L, of P is equal- 
ly applicable to the case in which P; lies 
outside the conic (Figure 6.499), the case 
in which Pı lies on the conic (Figure Figure 6.499 
6.4991), and the case in which P, lies 

inside the conic (Figure 6.4992). Let L be a line through P, which intersects the 
conic at two points Po(%2,y2) and P3(x3,y3). The tangents to the conic at P and 
P; have the equations 


(5) Axx + Byy = 1 
(6) Axsx + Bysy = 1. 


Except for the special line L which contains the center O, these tangents intersect 


Figure 6.4991 Figure 6.4992 


at the point P(x,y) for which the two equations (5) and (6) are both satisfied. 
Since P; lies on the line L containing P» and Ps, there exists a constant A such that 


(7) xı = Xo + A(xg — x2) = (1 — Alen + Axs 
(8) yı = y: + A (y3 — y) = (1 — A)ye + Ays 
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Multiplying (5) and (6) by (1 — A) and A, respectively, and adding give the 
relation 


(9) Axx + Byy = 1, 


which shows that the intersection of the tangents lies on the polar line L, having 
the equation (2). This is a remarkable geometric fact. Different lines through 
P, yield hordes of pairs of tangents, and all of the intersections lie on the same 
polar line Lı. We shall not undertake 
to prove the fact, illustrated by Figure 
6.4993, that pairs of chords as well as pairs 
of tangents intersect on the polar line L.. 
When the lines L and L’ through P; in- 
tersect the conic at four known points 
Pa, Ps, P, P3 as in the figure, the chords 
P,P, and PP} intersect at one point on L, 
the chords PaP} and PP, intersect at 
another point on Lı, and, moreover, these 
two intersections determine Lı. Thus 
(unless parallelism causes trouble) we can 
start with just four points on a conic and 
use them to determine a point P, and its polar line Ly}. This “four-point con- 
struction” is remarkable because the four points on the conic do not determine the 
conic. If we are given a fifth point Ẹ on the conic (such that no three of the five 
are collinear) then the conic is determined and we can produce the dotted lines of 
the figure to construct a sixth point Æ’. Appropriate use of polar points and 
polar lines provides methods for construction of more points. We are not 
expected to learn much about these matters in our elementary course, but we can 
be aware of the fact that many persons continue study of geometry to learn more 


Figure 6.4993 


6.5 Translation and rotation of axes To start learning about 
advantages gained by introducing supplementary coordinate systems, we 
look at an example so simple that the sup- 
plementary coordinate system is not 
needed. When we want to learn about 
the graph of the equation 


x? + y? — 10x — 8y + 32 = 0, 


we complete squares and write the equa- 
tion in the form 


(6.51) (x —5)2?+(y—4)2 =9. 


Figure 6.52 It is then easily seen that the graph is 

a circle with center at (5,4) and radius 3, 
and we can sketch the graph in Figure 6.52 without onerous calculations. 
If in the equation (6.51) we set 


(6.511) t=x—5, y=y—4 
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or, equivalently, 
(6.512) x=x +5, y=y +4, 
the equation takes the simpler form 
(6.513) xi? 4. y2 = 9, 


If we think of x’ and y’ as being coordinates (the prime or primed coor- 
dinates), then (6.513) looks like the equation of a circle having its center 
at the origin of the new prime coordinate system of Figure 6.52. It is 
customary to say that we “translate axes” when, as in Figure 6.52, we 
introduce a supplementary coordinate system that can be obtained from 
the original one by translations, that is, by slidings free from rotations. 
It sometimes happens that, in more complicated situations, introduction 
of a supplementary coordinate system helps us to determine the nature 
and position of the graph of a given equation. 

We now begin consideration of the graph of the equation Q = 0, where 


(6.53) Q = Ax? + 2Bxy + Cy? + 2Dx + 2Ey +F, 


it being supposed that 4, B, C, D, E, F are given constants for which 
A,B,C arenotall0. Incase B = 0, the following results can be obtained 
more quickly by completing squares. In any case, we undertake to 
determine constants h and k such that the substitution x = x’ + h, 
y = y’ + k will yield a simpler expression for Q. Substitution gives 


(6.531)  @Q = Ax!? + 2Bx'y! + Cy’? + 2D'x! + 2E'y' + F' 


where D’ = Ah + Bk + D 

E' =Bh+Ck+E 

F! = Ah? + 2Bhk + Ck? + 2Dh + 2ER + F. 
In case B? — AC # 0, we can simplify the expression for Q in (6.531) by 
determining h and k so that D’ = E’ = 0. Except in the special case 
in which F” = O when h and k are so determined that D’ = E’ = 0, the 
equation Q = 0 resulting from making D’ = Æ’ = Ois not easily graphed 
unless Æ =C =0 or B =0. Hence, at least in the study of Q when 
B #0, the results that lie ahead are much more important than those 
obtained by translation of axes. For those who may become interested 
in such things, it may be remarked that the quadratic form Q in 


(6.54) Q = Axx + Bey + Dxz 
+ Byx + Cyy + Eyz 
+ Dex + Ezy + Faz 


reduces to (6.53) when z = 1 and that there are places in pure and 
applied mathematics where these things are important. To attack 
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(6.54), we would use the formulas (2.67) 
(with xo = yo = zo = 0) from Section 2.6 
instead of the simpler formulas of the next 
paragraph. 

It is customary to say that we “rotate axes” 
when, as in Figure 6.551, we introduce a sup- 
plementary coordinate system that can be 
obtained from the original one by rotation 
about a line through the origin perpendicular to the plane of the given 
coordinate system. Without going back to review formulas from Section 
2.6, we shall use Figure 6.551 to derive the formulas 


Figure 6.551 


x = x’ cos 6 —y’ sin 9, x’ = xcosd+ysin 6 


(6.55) y = x’ sin 0 +y’ cos 6, y = —x sin 8 + y cos 0 


that relate the original and prime coordinates of a point P when the prime 
coordinate system is obtained by rotating the original axes through the 
angle 6. The primitive formulas 


/ / 


x y . x 
— = cos = = sin — 
- o z o 5 


= cos (¢ + 4), Z = sin ($ + 86) 


and the formulas for cosines and sines of sums give 


x = r cos (¢+ 0) = r cos ¢? cos 0 — r sin ¢ sin 0 = x’ cos 0 —y’ sin 0 
y = r sin (6+ 6) = r cos ọ sin 0 + r sin ọ cos 0 = x sin 0 +y’ cos 9. 


This gives the first set of formulas (6.55). The second set can be obtained 
by solving the first set for x’ and y’, or by making appropriate observa- 
tions about the result of replacing 0 by —9. 

Before attacking more ponderous expressions, we observe that the first 
formulas in (6.55) show that if Q = xy, then 


(6.552) Q = (x cos 6 — y’ sin 6)(x’ sin 0 + y’ cos @) 
= (x/? — y?) sin 0 cos 6 + x’y'(cos? 0 — sin? 6) 
= 4(x’? — y’?) sin 26 + x’y’ cos 20. 


We can eliminate the x’y’ term by making cos 20 = O and hence by setting 
20 = x/2 and 0 = x/4 (or 45°). This gives Q = 4(x’? — y’*). The 
graph in the xy plane of the equation xy = 1 is the same as the graph in 
the prime coordinate system of the equation 


x’? vy’? 
6.553 — 
(6.333) A Gd? 


Thus the graph of the equation y = 1/x is now thoroughly inducted into 
the hyperbolic fraternity. 
As in (6.53), let 


(6.56) Q = Ax? + 2Bxy + Cy? + 2Dx + 2Ey + F 
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and suppose that 4, B, C are not all zero. Rotating the x, y axes through 


the angle 6 gives new coordinates x’, y’, and using (6.55) shows that, in 
the new coordinates, 


(6.561) Q = Ax? + 2B'x'y + Cy’? + 2D'x’ + 2E'y' + F 
where 4’ = A cos? 6 + 2B sin 6 cos 6 + C sin? 6 
(C — A) sin 6 cos 6 + B(cos? 0 — sin? 6) 
C’ = A sin? 6 — 2B sin 6 cos 6 + C cos? 6 
D’ = D cos 6+ E sin 6 
E! = —Dsin 6+ E cos 6. 
Considerable information can be extracted from the above formulas 


without the aid of noncranial electronics. For each 6, we obtain the first 
of the formulas 


(6.562) A +C S= A+A, B?2 — A'C’ = B? — AC 


by adding the expressions for 4’ and C’. To prove the second formula, 
we can write an expression for the left side, cancel the terms that cancel, 
and obtain the result with the aid of the identity 


% 
l 


cost 0 + 2 cos? 6 sin? 6 + sint 0 = (cos? 0 + sin? 0)? = 1. 
Because of (6.562), the quantities 4 + C and B? — AC are said to be 


invariant under rotation of axes. 

The basic importance of the formulas for 4’, B’, C’, D’, E’ lies in the 
fact that they show us how to determine 0 and the coefficients in (6.561) 
so that B’ is zero and the objectionable term is missing. When we write 
the formula for B’ in the form 


B’ = 4(C — A) sin 20 + B cos 28, 


we can see that it is easy to find a unique angle @ for which 0 S$ 6 < x/2 
and B’ = 0. IncaseC = A, we set 0 = 7/2. IncaseC ¥ JA, we choose 
26 such that 0 < 26 < r and 


2B 
(6.563) tan 26 = 4-C 


Without repeating the formulas for the prime coefficients and the method 
by which 0 is found, we put our main result in the following theorem. 


Theorem 6.57 Each Q of the form 


(6.571) Q = Ax? + 2Bxy + Cy? + 2Dx + 2Ey + F 
can be put in the form 
(6.572) Q = A'x!? + Cly!? + 2D'x + 2E'y +F 


by suitable choice of an angle 0 for which O S 0 S 1/2. 
When we are asked about the graph of Q = 0, we can calculate the 
coefficients in (6.572) and then (completing squares when necessary) use 
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(6.572) to get the information. It is of interest to observe that we can 
get information about Q without making these calculations. Even when 
we do not know the numerical values of 4’ and C’, we know that 


(6.573) B? — AC = —A'C' 


when B’ = 0 because B? — AC is invariant under rotation of axes. In 
case B2? — AC < 0, we must have 4’C’ > 0, and (6.572) shows that the 
equation Q = 0 has elliptic type. In case B? — 4C = 0, we must have 
A'C' = 0 and the equation Q = O has parabolic type. In case B? — 
AC > 0, we must have 4’C’ < 0 and the equation Q = 0 has hyperbolic 
type. Sometimes it is necessary to know the fact, which was given in 
Problem 32 of Section 6.4, that graphs of equations of elliptic type are 
not always ellipses; they may be circles or points or empty sets. The 
number (2B)? — 44C or 4(B? — AC) is called the discriminant of Q, and 
its sign is the same as the sign of B? — 4C. Hence we can put the above 
results in a form that is easier to remember and use. 

Theorem 6.58 The equation Q = 0 (and, in fact, Q itself) is elliptic 
or parabolic or hyperbolic according as its discriminant 15 negative or zero 
or positive. 

Finally, we should know something about ways of using (6.563) to 
calculate the coefficients 4’, B’,C’, D’, E’. Itis possible to write formulas 
giving sin 6 and cos @ in terms of 4, B, C, but these formulas are so unat- 
tractive that we shun them. Applied mathe- 
maticians do not need lessons from this book to 
obtain approximations to answers by finding 28 
from a table or slide rule and then finding @ by 
gaily dividing by 2. When exact results must 
be obtained, we draw a figure more or less 
Figure 6.581 similar to Figure 6.581 and use elementary ideas 

to obtain cos 26. When 0 < 6 < r/2, cos 0 
and sin 0 must be nonnegative and their values can be calculated from 
the formulas 


(6.582)  cos0=. [1+ cos 70, sin § = [i = cos 26 


This gives the information needed for locating the prime axes and cal- 
culating the prime coefficients. 


Problems 6.59 


1 Sketch x, y axes having origin O and, in the same figure, sketch primed 
(or new) parallel axes with origin O’ having unprimed coordinates (2,3). Using 
the primed axes, sketch the parabola having the equation y’ = x’. Find the 
primed coordinates of the points 4 and B where the parabola intersects the lines 
having the primed equations x’ = —2 and x’ = 2. Find the primed coordinates 
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of the focus F and the primed equation of the directrix Dof the parabola. Finally, 
find unprimed equations and coordinates of the parabola and of 4, B, F, and D. 
2 Write the equation 
(+2)? | 0-3} 
a + _ 


(1) 73 1 
in the form 

(2) 4x? + By? + Cx + Dy +E =0. 
Then substitute 

(3) x=x +h y=y +k 


into the result and so determine 4 and k that the coefficients of x’ and y’ will be 
zero. Show that the new equation can then be put in the form 
x2 12 


y 
(4) pt oral. 


Finally, sketch properly related unprimed and primed axes and sketch the graph 
of (4). 

3 Show how a primed coordinate system can be introduced to simplify the 
process of obtaining basic information about the hyperbola having the equation 


(3)? _0-1?_ 


32 E l. 


Sketch a figure showing both sets of axes and the hyperbola. 
4 Translate axes to remove the first-degree terms from the equation 


xy — 2x + 3y —4 = 0. 


Remark: This gibberish means something. Let x =x’ +h, y =y +k and 
determine h and k in such a way that the coefficients of x’ and y’ in the new equa- 
tion will be 0. Ans.: h = —3, k = 2, x'y' = —2. 

§ Translate axes to simplify the equation 


x+t+eyty = 3. 


Ans.: Putting x = x' + h and y = y’ + k, we find that the first-degree terms 
disappear (have zero coefficients) when h = —1, k = 2, and that the simplified 
equation is x’? + x’y’ = 2. 

6 Construct and solve more problems 
similar to the preceding two, but keep the 
equations simple. We do not have time to 
do chores that computers should do. 

7 A rough graph of the equation 


Figure 6.591 


(1) yout 


is easily drawn. A miniature version appears 
in Figure 6.591. Since (1) can be put in the 
form 


(2) x? — xy +1 =0, 
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the graph must be a conic. While it is possible to foresee some of the results of 
putting (2) into standard form by “‘rotation of axes,” we examine the details of 
the process. Show that substituting 


(3) x =x'cos6—y'siné, y =x’ sin 0 + y’ cos 0 


into (2) gives an equation in x’ and y’ for which the coefficient of x’y’ will be zero 
when tan 20 = —1 and hence 


3r 3r l +cos20 v2 -—1 

(4) 20 = 7 0 = y> cos? OD OR” 
ang = bcos 20 _ V2 +1. 
7 2 2/2 


Continue the work to show that the equation in x’ and y’ can be put in the form 
; x!?2 y’? 
©) 2J2+2 272-2 
Note that the result agrees very well with Figure 6.591. The distance from the 


center of the hyperbola to its vertices is V2 +2 1/2. It is possible to use (5) 
to obtain more information in primed coordinates and to express this information 
in terms of the original coordinates, but all of these operations consume consid- 
erable time. 


8 Obtain preliminary information about the graph of the equation 
a(x +y)=1 


by solving for y (or for x) and making a rough sketch. Then obtain more precise 
information by rotation of axes. 
9 Apply the procedure of Problem 8 to the equation 


x* + xy +y = 1. 


10 Make and solve more problems similar to Problems 8 and 9, but keep 
the equations simple. We still do not have time to do chores that computers 


should do. 
11 When a and b are positive, the equation 


ax? + by? — ah? = 0 
has elliptic type. For what values of k does the equation 
a*x? + key + b*%y? — a%b? = 0 


have elliptic type? Parabolic type? Hyperbolictype? Hint: The discriminant 
is $? — 4a?b?, and we are put on the right track when we notice that this is nega- 
tive and our equation is elliptic when & = 0. 

12 When a and b are positive, the equation 


a*x? — b?y? — gh? = 0 
has hyperbolic type. For what values of & does the equation 


ax? + key — b*y? — a2b? = 0 


1. 
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have hyperbolic type? Parabolic type? Elliptic type? Hint: Get on the right 
track. 

13 We should know something about the possibility of obtaining information 
about the graph of the equation 


Ax? + 2Bxy + Cy? + F =0 


by solving for x or y by use of the quadratic formula. Other cases being similar 
or much easier, suppose that C > 0, B #0, and F #0. Show that a point P 
lies on the graph iff (iff means if and only if) its coordinates satisfy the equation 


_ —Bx + VB? — AC)x® — CF 
a a 


Show that if B? — ÆC > 0, then the points (x,y) for which |x| is sufficiently 


great and 
x CF 
y= 3] -24 æ- 40) -F| 


must lie on the graph, and tell why the graph cannot be an ellipse. Show that 
if B? — AC < 0, the graph cannot be a hyperbola. 
14 Sketch a graph of the equation 


y=s+ VI 


by sketching graphs of yı = x and ye = ~/1 — x? and adding ordinates (that is, 
values of y). Give some precise information about the graph. 
15 It is sometimes said that the graph of the equation 


xy + vy = aX, 


where a is a positive constant, is a parabola. Is this true? Ans.: No, but the 
graph is a part of a parabola. 

16 A wheel of radius a rolls, without slipping and with angular speed w, on 
the top side of the x axis of an xy plane. At time zt = 0, the center is above the 
origin, and a pink spot P which rotates with the wheel lies b units below the center 
of the wheel. Show that if an x’, y’ coordinate system, with origin at O’, travels 
with the wheel but keeps its unit vectors i’ and j’ in the directions of i and j, 
then 


— 


O'P = —b sin wti — b cos wij. 
Show that, because the wheel rolls without slipping, 
OO’ = awti + aj. 
Show that the vector r running from O to P is 


r = (awt — b sin wt)i + (a — b cos wt)j. 
Sbow that 
v = (aw — bw cos wt)i + (bw sin wt)j 
a = bw?(sin wti + cos wtj). 
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Remark: Moving coordinate systems and vectors provide the simplest way of 
obtaining neat and correct answers to problems more or less like this one. The 
paths traced by the pink spots P are cyclozds. For future reference, we note that 
if b = a, so that P is on the rolling circle and the cycloid is an ordinary cyclord, 
and if we let 0 be the angle wi through which the wheel has rotated at time i, 
then the coordinates of P are 


x = a(b — sin 0) y = a(l — cos 0). 


Figure 6.592 exhibits the ordinary cycloid having these equations. The unusual 


Figure 6.592 


cycloids are sometimes called curtate cycloids or prolate cycloids, and are sometimes 
called trochoids. 

17 Find equations of epicycloids, that is, paths traced by points on spokes 
(or extended spokes) of circular wheels which roll, without slipping, outside a 
fixed circular wheel. Outline of solution: As in Figure 6.593, let the fixed circle 
have radius a and have its center at the origin of an x, y coordinate system. 
Let the rolling circle have radius b and have center O’ which travels with it, and 
let wt (not restricted to the interval 0 S wt < 27) be the angle which OO’ makes 
with the x axis at time ż so that 


(1) OO = (a + b)(cos wti + sin wf). 


We suppose that w > 0, so the spokes in 
the moving wheel rotate in a positive 
direction. Let the point P which traces 
the epicycloid be the point for which 
O'P = ci when z = 0, and let S be the 
spoke of the moving wheel which (ex- 
tended if necessary) contains P. The 
Figure 6.593 angle y (psi) through which the spoke S 
has turned at time ¢ is the sum of two 
angles: (i) the angle @ through which it would turn if its wheel rolled the distance 
awt along a straight line and (ii) the angle wt through which it would turn if it slid 
without rolling on the fixed circle. We find that awt = b0, so 6 = (a/b)ut, 


(2) pa tte, 
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and 


(3) OP = c (cos $F" ant + sin 2+? oni) 


at time z. From (1) and (3) we obtain 


(4) r =| (a+b) cos wt + ¢ cos E or |i 


+ | (@ +b) sin ot + c sin E oar | j 


for the displacement vector of the point P on the epicycloid at time £ An ordi- 
nary epicycloid is obtained by setting c = —b so P starts at the initial point of 
tangency of the two circles. Remark: For the case in which ec = —b andb =a 
we obtain the epicycloid of one cusp having the equation 


7 


(5) r = a[2 cos wt — cos 2wtli + a[2 sin wt — sin 2w#]j. 

Using the trigonometric identities 

(6) cos 26 = cos? § — sin? 8 = 2 cos? 9 — 1, sin 26 = 2 sin 8 cos 8 
enables us to put (5) in the form 

(7) r — ai = 2a(1 — cos wt)(cos wti + sin wij) 

from which we see that 

(8) lr — ai] = 2a(1 — cos wt). 


Letting p denote the distance from the point (4,0) to the point P on our epicycloid 
and setting @ = wt puts (8) in the form 


(9) p = 2a(1 — cos ¢). 


The polar coordinate graph of (9) is a cardioid. Thus we have discovered that 
an epicycloid with one cusp is a cardioid, and we have started learning about 
epicyclic gears. That the Greek prefixes epi and hypo mean outside (or above) 
and inside (or below) can be remembered by those who have hypodermics put 
under their skins. 

18 Find equations of hypocycloids, that is, paths traced by points on spokes 
(or extended spokes) of circular wheels which roll, without slipping, inside a 
fixed larger circular wheel. Outline of solution: This problem is much like Prob- 
lem 17. To get the answer from (4) of Problem 17, replace b by —& because 
|00'| = a — b. For hypocycloids the spokes of the inner wheels run backwards. 
The equation giving displacement vectors of points on hypocycloids is 


(1) r=| (2-5) cos wt + ¢ cos “=~ wt | i 


+ [la = b) sin wt — c sin 2 or |. 


An ordinary hypocyeloid is obtained by setting ¢ = b soP starts at the initial point 
of tangency of the two circles. A graph appears in Figure 7.291. Remark: For 
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the case in which c = b and b = a/4, we obtain the hypocycloid of four cusps 
having the equation 


(2) r = $ [3 cos ¢ + cos 3¢li + F [3 sin 6 — sin 3¢]j 
or 

(3) r = a[cos® wti + sin? wej] 

or 

(4) x + y% = a”. 


19 Persons interested in machinery should determine the path traced by a 
cog on the inner wheel of a hypocyclic gear when the radius of the inner wheel is 
just half the radius of the outer wheel. 

20 A rod of length 2a, always in the xy plane, is whirling about its center with 
angular speed w and, at the same time, is so thrown that its center has coordinates 
(At, Bt — Ct?) at time t. Supposing that a red spot on the stick started at the 
point (0,2) at time ¢ = 0, find its position and velocity and acceleration at later 
times. Try to solve the problem without use of the following outline. If 
unsuccessful, look hastily at the outline to get some ideas and then try to solve 
the problem with the outline out of sight. Outline of solution: Use vectors and 
a moving coordinate system. Let an x, y coordinate system with unit vectors 
i and j be drawn and remain fixed (that is, always in the same place). Let an 
x’, y’ coordinate system have origin O’ at (41, Bt — Ct?) at time ż, and let its unit 
vectors i’ and j’ have the directions of i and j so that, in the world of the moving 
coordinate system, the stick is doing nothing but rotate about O’. Letting r 
be the vector running from O to P, we have 


r = 0O’ + OP 
= (dt + a cos wt)i + (Bt — Ct + a sin wt)j 
SO 
v = (4 — aw sin wt)i + (B — 2Ct + aw cos wt)j 
and 


a = (—aw? cos wt)i + (—2C — aw? sin wt)j. 


21 Let a and b be positive constants, and let G be the graph in the x’, y’ plane 
of the first of the equations 
1 


yy’ = ae oz", y = at dk ae 
2r o 


Find two positive constants \ and ø such that G will be the graph of the second 
equation in the x, y plane, the primed and unprimed coordinates being related 
by the formulas x’ = Ax and y’ = Ay. Ans.: A = (ma?/b), o = (1/4ra?b)r, 


6.6 Quadric surfaces This brief section, which contains no terminal 
list of problems, can be omitted, but it can be read even by students who 
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are not required to read it. As is easy to see by considering such special 
examples as 


661) @t+ytt+24+1=0, ty t=O x+y? =0, 
the graph in Æ; of an equation of the form 


(6.62) Ax? + By? + Cz? + Dxy + Eys + Fzx + Gx + Hy 
+iz+/f/=0 


can be the empty set or a point or a line. Incase 4, B,C, D,E,F are not 
all zero and the graph is a surface, the surface is called a guadric surface. 
Examples of the form 


(6.621) Ax? + By? + Cxy + D =0, 


where 4, B, C are not all 0, show that some quadric surfaces are cylinders 
which may be planes or pairs of planes. Our interest in this section lies 
in quadric surfaces that are not cylinders. 

As we study quadric surfaces, it will be helpful to have some results of 
Section 2.6 in mind. A line which does not lie completely on a quadric 
surface can intersect the quadric surface in at most two points. Each 
plane section of a quadric surface must be the empty set or a single point 
or a single line or two lines or a circle or a parabola or an ellipse or a 
hyperbola. From this we can conclude that each nonempty bounded 
plane section of a quadric surface must be either a point or a circle or an 
ellipse. 

When the equation (6.62) of a quadric surface is given, it is possible to 
introduce a new coordinate system in such a way that the equation in the 
new coordinates has one or another of several standard forms. There are 
places, even in applied mathematics, where it is necessary to know pro- 
cedures for putting given equations into standard forms, but the pro- 
cedures are much too complicated for coverage in elementary courses. 
In this course we can be content with a little information about the 
standard forms and their graphs. In what follows, it is always assumed 
that a, b, ¢ are given positive constants. ‘The standard forms are 
selected in such a way that, insofar as is possible, the coordinate planes 
are planes of symmetry and the curves in which the surface intersects 
planes parallel to the coordinate planes are easily described and sketched 

The graph of the equation 


(6.63) uy Ey 


is an ellipsoid except when a = b = cand the graph is a sphere. Putting 
z = k shows that the ellipsoid intersects the plane having the equation 
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z = k in a set which is the empty set if |k| > c, a point if |k| = c, and an 
ellipse (or circle if a = b) if |k| < c. Very similar remarks apply to the 
planes having the equations x = kand y = k. It is easiest to sketch the 
intersections (or sections) for which k = 0, but full information about 
other sections parallel to the coordinate planes is easily obtained. For 
example, when |&| < c, we can put z = k in (6.63), transpose the term 
involving c2, and then divide by the new right side to obtain 


2 
+ + = 1. 
=) 


In case a ¥ b, this shows that the graph intersects the plane having the 
equation z = in an ellipse which has its center on the z axis and which 
intersects the planes having the equations x = 0 and y = Q at the points 


2 2 
(o. +b Ji — E é), (+24) — E 0, 4). 
Cc Cc 


In case a = b, the section is a circle. Unshaded and shaded graphs appear 
in Figures 6.631 and 6.632. Incase a = ¢ and b is greater than a and c, 


Figure 6.631 Figure 6.632 


the graph is a prolate spheroid more or less like a cucumber. In case 
a = b and c is less than a and b, the graph is an oblate spheroid more or 
less like the earth (which is depressed at the poles and bulges at the 
equator) or like a pancake. It is possible to use material from Section 2.6 
and Chapter 6 to show that each plane section of an ellipsoid must be an 
empty set or a single point or a circle or an ellipse. 

The graph of the equation 

x? y2 g2 

(6-64) atpo an} 
is a hyperboloid of one sheet. It intersects the plane having the equation 
z = kin an ellipse (or circle if a = b). It intersects the plane having the 
equation y = k in a hyperbola when |k| = b and in a pair of lines when 
|k| = b. It intersects the plane having the equation x = kin a hyperbola 
when |&| + a and in a pair of lines when |4| = a. Unshaded and shaded 
graphs appear in Figures 6.641 and 6.642. 
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Figure 6.641 


The graph of the equation 
2 2 2 2 9 2 
66) S+#-5=-l1 o ~-5-$4+5=1 


is a hyperboloid of two sheets. It intersects the plane having the equation 

Figure 6.651 z = kin the empty set when |&| < c, in a point 
when |&| = c, and in an ellipse (or circle) when 
|k| >c. It intersects the planes having the 
equations y = k and x = k in hyperbolas. 
Unshaded and shaded graphs appear in Fig- 
ures 6.651 and 6.652. 


Figure 6.652 
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The graph of the equation 
16.66) 


2 2 
5 + a = % 

is an elliptic paraboloid. It intersects the plane having the equation 
z = k in the empty set when k < Q, in a point when k = Q, and in an 
ellipse (or circle) when & > 0. It intersects the planes having the 
equations x = k and y = k in parabolas. Unshaded and shaded graphs 
appear in Figures 6.661 and 6.662. 
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Figure 6.661 Figure 6.662 


ai 


The graph of the equation 
(6.67) 


i) 


eae 
PE 


SIS 


is a hyperbolic paraboloid. It intersects the plane having the equation 
z = kin a hyperbola when k < Q, in a pair of lines when k = O, and ina 
hyperbola when k > 0. It intersects the planes having the equations 
x = k and y = k in parabolas. Unshaded and shaded graphs appear in 
Figures 6.671 and 6.672. Hyperbolic paraboloids are the simplest 


Figure 6.671 Figure 6.672 : 
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examples of surfaces known as saddle surfaces. The origin is a saddle 
point, and a huge surprise awaits embryonic mathematical physicists who 
suppose that only cowboys are interested in them. 

Finally, we should not forget the cones. When 4 and B are nonzero 
constants not both negative, the graph of the equation 


(6.68) Ax? + By? = z? 


is a nondegenerate guadric cone It is a cone because the point 
(Ax,Ay,Az) lies on the graph whenever à is a number and the point (x,y,z) 
lies on the graph. It intersects the plane having the equation z = kina 
point (the vertex of the cone) if k = 0 and in a central conic (circle. 
ellipse, or hyperbola) having its center on the z axis if k #0. Instead of 
exhibiting graphs and photographs of quadric cones, we conclude with a 
remark. Since we know that hyperbolas have asymptotes, we need not 
be surprised to learn that hyperboloids can have asymptotic cones. The 
cone having the equation 


(6.69) 


is the common asymptotic cone of the two hyperboloids having the equa- 
tions (6.64) and (6.65). 

While some of us will meet ellipsoids and other quadric surfaces after 
completing this course, we need not invest our time in consideration of 
more or less routine problems involving special quadric surfaces. We 
have earned the right to think about a little (or big?) problem that can be 
of interest to those who slice onions and other things. Let S be a set in 
E; which contains more than one point. Suppose that, for each plane r, 
the intersection of S and v is a circle or a set consisting of just one point or 
the empty set. Do our hypotheses imply that S must be a sphere? 
If itis easy to make an incorrect guess and if the problem is difficult, so be 
it. If there is just one easy way to solve the problem and only one person 
in a million can discover the way, so be it. An anecdote illustrates the 
fact that we are sometimes provided opportunities to do our own thinking 
and investigating. In a lecture to advanced students at Cambridge, 
G. H. Hardy stated that the answer to a particular question is obvious. 
Becoming dubious about his assertion, Hardy asked his students to excuse 
him while he sat down to think about the matter. Five minutes later, 
Hardy arose to report that the answer is obvious and to continue his 
lecture. 


Curves, 


T lengths, 


and curvatures 


7.1 Curves and lengths One of the main purposes of this chapter 
may seem at first sight to be quite modest. We want to show that the 
length L of the circular arc of Figure 7.11, in which 4BCD 


B C is a rectangle and the arc is tangent to CD at D, satisfies 
the inequality 
(7.12) [4B| < L < [BC| + [CD]. 
Before we ask our little sister to solve the problem, we 
A D should ask ourselves a question that shows us that the 


Figure 7.11 problem is not completely simple. What is a circular arc, 
and how do we know that it determines a number that can 

be called its length? It is clear that we need some definitions before we 

can do anything. 
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For an accurate treatment of matters relating to lengths of curves, we 
need more information about things that are sometimes called curves 
and are sometimes called oriented curves. A bumblebee can give us 
preliminary ideas by starting at 4, flying to a rose at B, flying on to 
another rose at C, flying back over the same route to B, and then flying 
to D and on to £ asin Figure 7.13. His 


total path is a curve, and we get the idea 4 c E 
that a curve is not determined by a point \& = j 
set. We cannot know what the curve is B A 

-y 


until we know the order in which points on 
the curve were visited. For example, if Figure 7.13 
the bumblebee flies from E back to 4 by 
traversing his route in reverse, his path is a new curve which can be called 
the negative of the original one. While the matter has psychological 
rather than logical importance to us now, we can feel quite confident that 
if the bumblebee makes a flight over the interval t; S t S te, and if we 
introduce an x, y, z coordinate system, then at each time in the interval 
he is surely someplace and that if we denote his coordinates by x(t), 
y(t), z(t), then these coordinate functions are continuous functions of ¢. 
Of course, the curve does not uniquely determine the coordinate functions 
because the bumblebee can fly his course at different speeds, but any one 
set of appropriate coordinate functions does determine the intrinsic 
curve. The above brief discussion of curves leaves many unanswered 
questions. It will have served its purpose if it provides a hazy feeling 
that the following definition uses words in a reasonable way. 

Definition 7.14 If x,y, z are continuous functions of t over an interval, 
then the ordered set of points 


(7.141) P(t) = P(x(t), yC), 2(¢)) 


for which t lies in the interval, and for which P(t’) is said to precede P(t”) if 
t! < t, ts a curve (or oriented curve) C. 

Professional creators of complicated curves can give an example of a 
curve in E; that is clever enough to “pass through” each point in a given 
cube or even in the whole £3. Such curves are space-filling curves. It 
must not be presumed that all curves are complicated things, however. 
For example, when z(t) = 0 for each #, the curve lies in the xy plane and 
we omit the z(#) when no confusion can result. We always have the 
possibility of setting x(t) = t and replacing t by x. This shows that if f 
is continuous over a S x < b, then the set of points (x, f(x)) on the graph 
of y = f(x) becomes a curve when we so order the points that (x1, f(*1)) 
precedes (x2, f(x2)) when a S xı < x2 S b. Unless an explicit statement 
to the contrary is made, it is presumed that the points are ordered in this 
way whenever the graph of y = f(x) is called a curve. Finally, such 
things as circles, ellipses, rectangles, and triangles become curves as soon 
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as we think of them as being traversed once in the positive (counter- 
clockwise) or negative (clockwise) direction. 

Supposing that C is a given curve for which the interval appearing in 
Definition 7.14 is a closed interval a £ t S b, we proceed to define (when 
it exists) a number |C| which is called the length of C. Let P be a parti- 
tion of the interval by partition points to, t1, * * * , ta for which 


(7.15) a= to Lt Lti <et < tiri Ltn =b. 


For each & = 0, 1, 2, © + + , n, let Py be the point on C for which ż = t. 
Figure 7.151 is a schematic figure that may be 
helpful, but it is much too simple to show how 
much care is needed to guarantee that the points 
are not scrambled in an inappropriate way. The 
polygon (or broken line) running from Po to P; 
to Pa and so on to P, is said to be inscribed in C. 
The number Sp defined by 


n 


Figure 7.151 Sp = 2 |Pr-P:l 
=] 


is the sum of the lengths of the sides of the inscribed polygon. Let S 
be the set of numbers Sp obtained by making partitions P of the interval 
a<t<zb. Incase there is no number M such that Sp S M whenever 
Sp is in S, it is said that C does not have length or that C does not have 
finite length or that C has infinite length or that |C| = ©. In case there 
is a number M such that Sp S M whenever Sp is in S, then Theorem 5.46 
guarantees existence of a least number |C| such that Sp < |C| whenever 
Sp isin S. This number |C| is then, by definition, the lengtht of C. 

For future reference we state without proof the following theorem which 
involves notation we have been using. 


Theorem 7.16 The curve C has finite length |C| tf and only if 


n 


C| = lim $, PeaPul. 


|P—>0 k=l 


We now use the definition of length to prove the following theorem 
which gives, as a corollary, the desired result involving Figure 7.11. 

Theorem 7.17 Let f be continuous and monotone increasing (or mono- 
tone decreasing) overa S x Sb. LetC be the graph of y = f(x) so oriented 


t In old books particularly, a curve C having finite length is sometimes said to be recti- 
fiable; the ancient idea is that if C is a string we could pull on the ends and straighten it out 
to get a straight string of finite length. We can all know enough about logic to know that 


if C is not a string, then little credence is placed upon consequences of the false assumption 
that C is a string. 
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that (xı, f(x1)) precedes (xe, f(x2)) when xı < xa Then C has finite length 
.C| and 


(7.171) IF) — fla)| S |C] < |b — al + |f0) — FO). 


To prove this, let P be a partition of the interval a S x < b, and, for 
each &, let Pr = (xr, f(xx)). Let Ax, = x, — xp, and let 


Ayn = f(x) — f(r). 


Then 
(7.172) \PriPr| = V Ax? + Ay?. 
But 


(7.173) Ay; S Ax + Ayk S |Ax,z|? + 2|Axz Aya) + [Ayz|? 
= (|Ax| + [Aya !)*. 
Therefore, 


(7.174) [Ayx| S |Px—1Prl SS Ax + Ayal. 


But the numbers Ax; are all positive and the numbers Ay, are all negative 
(or all positive), and hence (why?) addition gives 


(7.178) O — fl@)| SX PeP = |b ~ al + G) — FO), 


Since this inequality holds for each partition P, the curve C has finite 
length |C|. Moreover, the least upper bound of the central sums cannot 
exceed the particular upper bound in (7.175), and no upper bound can 
be less than |f(b) — f(a)|. This proves Theorem 7.17. 

We have been studying lengths of curves, that is, B 
lengths of ordered sets of points of special types. There a 
are other (and different) theories involving lengths of S 
unordered sets S in £3 and E and £;. We need not be Figure 7.18 
authorities on these matters, but we should have at least 
a vague idea that, in elementary mathematics, the part of the set S of 
Figure 7.18 that lies “between” 4 and B has length L if there is a “natural 
ordering” of the points of the part that yields a curve C having length L. 


Problems 7.19 


1 Tell how the length |C| of a curve C is defined. 

2 Show that if a curve C having finite length runs from 4; through 42 to 
As, then £z separates the curve into two parts each having finite length. It is 
not quite so easy to show that “‘the whole is equal to the sum of its parts,” but 
it can be done. In particular, the length of a simple polygon is the sum of the 
lengths of its straight segments. 
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3 Show that the projection on the xy plane of a curve in £3 is a curve in Ey, 

4 Discover continuous functions x and y such that as ¢ increases over the 
interval 0 Sz S 4, the point (x(¢), y(t)) goes once in the positive direction 
around the square having vertices at the points (0,0), (0,1), (1,1), and (0,1). 
Hint: Write the equations which tell where a pencil point will be at time ¢ if it 
traverses the square with unit speed. One ans.: x(t) =t and y(t) = 0 when 
0S2¢81;x() = landy(t) =¢ —1whenl SiS 2;x(¢) = 3 —tand y(t) =1 
when 2 < ż < 3; x(t) =O and y(t) = 4 —i when 3 S t S$ 4. 

5 Let Lı be the length of the central path from 4 to B in Figure 7.191 and 
let Ly be the length of the upper path. Find La — Lı. Ans.: (m — 2)h. 


Figure 7.191 Figure 7.192 


6 Supposing that the inner and outer circles of Figure 7.192 have radii e 
and R, find the length of the curve which begins and ends at J. 

7 Draw a square and from each corner draw a line segment toward the 
center but reaching only halfway to the center. Then insert arrows to specify 
a curve which forms the boundary of the inner region. Find the length of the 
curve. 

8 This problem requires us to think about and calculate the lengths Lı and 
L: of two of the paths by which an insect might crawl from the bottom B to the 
top T of the three-dimensional ring or anchor ring or 
torus of Figure 7.193. The first path, of length Ly, 
is a semicircle of radius b + a which lies on the outer 
circumference of the torus. The second path, of 
length Le, consists of a semicircle of radius a, and 
then a semicircle of radius b — a which lies on the 
inner circumference of the torus, and finally another 
semicircle of a radius a. Try to guess which of the 
paths is shorter and then calculate Lı and Le. Ans.: 


Lı = (a+ 8), Lz = ma + r(b — a) + ra, 


Figure 7.193 so the two paths have equal lengths. Remark: Ifa 
curve G lies in a set S and joins two points P, and 
Pe of S, and if the length of G is less than or equal to the length of each other 
curve C in S which joins P, and Pz then G is called a geodesic in S. As is easily 
imagined, the study of geodesics on a torus is an honorable part of an interesting 
subject. 
9 Creators of interesting tales for children say that, before the time when 
Columbus sailed across the ocean wet, everybody thought that the earth was 
flat. It has in fact been widely known for more than two thousand years that 
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the earth is much like a spherical ball, but is not a spherical ball because moun- 
tains and valleys are quite noticeable around the Mediterranean sea and some 
other places. To show that a little information about lengths of arcs can have 
quite astonishing consequences, we look at the method said to have been used 
by the industrious Eratosthenes (c. 275 B.c. to c. 195 B.c.) to find what the radius 
(a number) of the earth would be if the earth were sandpapered to the shape of a 
spherical ball. Figure 7.194 shows O, the center of the earth, 
and a circular arc 4B on the surface of the polished earth. 
The dotted vertical lines represent rays of light coming from 
the sun, and these rays are so nearly parallel that approxi- 
mations can be based on a figure in which the rays are parallel 
to the line O4. Thus an observer at 4 finds that the sun is 
at his zenith. At Ba pole BC of height h is erected in such 
a way that an observer at B thinks it is vertical, that is, the 
points O, B, C lie on the same line. In addition to h, two 
other lengths are obtained. In the first place, we find the 
length a of the circular arc DC which has its center at B (the 
base of the pole BC) and which has a shadow that just covers Figure 7.194 
the pole. In the second place, the length b of the arc AB is 

obtained by more or less reliable surveyors. Let r be the radius of the earth. 
Since parallelism of the light rays implies that the angles DBC and AOB are equal, 
say to 6, we obtain the equation 


il 


=a 
NIS 


from the fact that each of the ratios is equal to 0. From this equation, r is easily 
calculated. It is not to be presumed that Eratosthenes spoke English and knew 
about miles, meters, and radians, but it is said that his calculations produced 
estimates of the radius and circumference of the earth that are almost as good as 
the estimates (radius 4000 miles, circumference 25,000 miles) that are ordinarily 
used for rough calculations. The fact that reliability of computed results depends 
upon accurate surveying is precisely the reason why two men named Mason 
and Dixon were commissioned to do some accurate surveying. 

10 This problem, like some others, does not have a number for an answer. 
It requires us to think about connections between graphs and curves, and to 
learn some geometric terminology. The graph of a polynomial of degree n, 
that is, the graph of an equation of the form 


(1) y = ao + ax + ax? + > + + ant’, 


in which a, Æ 0, used to be (and occasionally still is) called a parabola of degree n. 
For example, a line is a parabola of degree 1 and a cubic is a parabola of degree 3. 
While this terminology is almost extinct, the graph of the equation 


(2) y? = x? 


still is called a semicubical parabola, the old idea being that, when we solve for y, 
we get an exponent which is not an integer but is half of 3. According to this 
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terminology, the semicubical parabola is a graph (not 
necessarily a curve) which is the point set shown in Figure 
7.195. The graph has two branches, one being the upper 
branch consisting of points for which y = x%, and the 
other being the lower branch consisting of points for 
which y = —x%. Each of these branches contains the 
origin. 

When the points on the upper branch are ordered in 
such a way that P(x,y1) precedes P(x2,y2) when x1 < x, 
the branch becomes a curve C, in the first quadrant 
having an end (or end point) at the origin. Similarly, 
when the points on the lower branch are ordered in such 
Figure 7.195 a way that P(x1,1) precedes P(x2,y2) when x1 < xe. this 

branch becomes another curve C2 having an end at the 
origin. With these orderings, the graph consists of the points on the two curves C, 
and C», the orderings being indicated by the arrows of Figure 7.195 which lie between 
the curves and the positive x axis. When x > 0, the curve Cı has a tangent at the 
point (x,«%) which has slope $x, and the curve C% has a tangent at the point 
(x, —x%) which has slope —$x%. We can observe that C, has a forward tangent 
at the origin whose slope is 0 and that Cz has a forward tangent at the origin whose 
slope is 0. Although we have definitions involving tangents to graphs of equa- 
tions of the form y = f(x) and have definitions involving tangents to curves, we 
have no definitions which we can apply to decide whether the x axis is tangent 
to the semicubical parabola at the origin. In accordance with terminology 
other applications of which are not explained here, we say that the semicubical 
parabola has a cusp at the origin. 

11 Prove that the curve C consisting of points P for which 


(1) OP = ti + 23] 


coincides with the semicubical parabola having the equation y? = x8. Sketch 
a new graph of the semicubical parabola and insert arrows which show how the 
points on the semicubical parabola are ordered to produce the single curve C. 
Put (1) in the equivalent form 


(2) r= fit 1 


and, supposing that a particle moves along C in such a way that its displacement 
vector at time ż is r, find its velocity and acceleration at time ż See what hap- 


pens when you try to write the equation of the line tangent to C at the point 
occupied by the particle at time z = 0. 


12 The folium of Descartes is the graph of the parametric equations 


3 at 3 at? 
(1) x= I+? y = IFE 


The vector r running from the origin to the point P(x,y) on the folium is 


t e t? 
(2) r= 3a| ait posi) 
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there being one such point for each ż for which 1x —1. One who has consid- 
erable time to study the functions defined by (1) and to make some calculations 
can discover the manner in which P wanders as ż increases. As ¢ increases over 
the interval — © < ż < —1, P runs over the curve C, extending from the origin 
(but not including the origin) down into the fourth quadrant. As ż increases 
over the interval —1 < ż < œ, P traverses 
the curve Cz which comes from the third 
quadrant to the origin and then runs around 
the loop toward the origin but does not 
again contain the origin. This problem 
requires that we obtain some more informa- 
tion without working so hard. Show that 
if (1) holds, then y = tx and 


(3) x3 + y3 = Saxy, 


Show that if (3) holds and y = zx, then (1) 
holds. Thus (3) is an equation of the 
folium. If P(x,y) is a point on the folium, 
then (3) shows that P(y,x) is also on the 
folium. Therefore, the line having the 
equation y =x Is an axis of symmetry 
of the folium. This suggests that we introduce the X, Y axes of Figure 7.196 
which bear unit vectors I and J. Use the formulas 


Figure 7.196 


1 1 
1 = —— I — i = — I + 


with (2) to obtain the new equation 


_ 3a t (1 — 2) 
o) ~aliseeelt tee T] 


ofthe folium. Treating (5) as an equation of the form r = XI + YJ, show that 


Sot o A ja 1e 
Jzi-i+t® A Wad—-t+? 


and use the result to find the minimum and maximum values of X and obtain 
more information about the folium. 

13 Let f be continuous over the interval 0 S x S 1 and let C be the set of 
points (x, f(x)) so ordered that (xı, f(x1)) precedes (x2, f(x2)) if xı < xo. We 
may feel that C must have finite length, and we investigate the matter. Theorem 
7.17 shows that C must have finite length if fis piecewise monotone, but we cannot 
be so sure when C is not piecewise monotone. When # and q are positive integers, 
the function sin? x% [or the function g for which g(x) = sin? x3] is not piecewise 
monotone over the infinite interval x > 1 and the function sin? (1/x?%) is not 
piecewise monotone over the interval0 < x <1. Functions f for which f(0) = 0 
and 


f(x) = x" sin? = 


416 Curves, lengths, and curvatures 


are, when r > 0, continuous but not piecewise monotone over 0OS%*81. Sup. 
posing that p=g = r = 2, show that 0 £ f(x) S 1 and f is differentiable as 
well as continuous over 0 <x < 1. It can be proved that C does not have 


finite length. 
14 It is worthwhile to cultivate the 
Rz ANN 
fimmt AN 


ability to understand and even to con- 
struct simple examples of curves that 
have properties of various sorts. Figure 
7.197 shows a part of the graph C of a 


$ function f, defined over 0 Sx < 1, for 
Figure 7.197 which f'(x) exists when 0 < x S 1 and, 
moreover, f(x) = f'(x) = 0 when x = 0, 

1,4, 4%, °° °.. We do not bother to give formulas expressing f(x) in terms of 


x, being content to describe the graph C. The rectangles have their upper left 
corners on the graph of the equation y = x?. The part Cı of the graph in the 
rectangle R, over the interval $ S x S 1 consists of four monotone parts. Show 
that 1 < |C] S$ 2. We now turn our attention to the rectangle Re which lies 
above the interval + < x < 4 and has height (¢)%. In this rectangle we construct 
another part Cz of C, this part consisting of 4? monotone parts so that 1 <S |C,| < 
2. Tell how to continue the construction so that the total resulting curve C 
will not have finite length. 

15 Let x(t) and y(t) be continuous over a S?t <£ b so that the set of points 
P(t) for which P(t) = (x(t), y(#)) and a St S b, so o ordered that P(t,) precedes 
P(t.) when t; < ta, constitutes a curve C in the xy plane. The curve C is said to 
be a closed curve if P(a) = P(b), that is, if the curve ends where it starts. Give 
some examples of closed curves and some examples of curves that are not closed. 

16 A set A in the xy plane is said to be connected if to each pair Pı and P» 
of points in 4 there corresponds a curve C of the type described in Problem 15 
such that P(a) = P;, P(b) = Pe, and each point of C is a point of 4. Let T 
(capital gamma) be the circle with center at Po and radius r. Prove that the 
set 4, of points P for which IPP! < r (this set being the interior of I’) is a con- 
nected set. Prove that the set 4, of points P for which |P,P| > r (this being the 
exterior of T) i is a connected set. Prove finally that hat if a set 43 contains a point 
P, for which |PP,| < r and a point Ps for which |PP,| > r but contains no point 
of the circle T, then 43is not a connected set. Proof of last part: Suppose, intend- 
ing to establish a contradiction, that the set 43 is connected. Then there is a 
curve C, determined by continuous functions x(t) and y(t) as in Problem 15, such 
that P(a) = Pı, P(’) = Pa, and each point of C is a point of 43. Let 


f() = [PPO = VG — x)? + Go — yO)? 
so that f(z) is the distance from Po to PU Then f(a) < rand f(b) > r. Since 


f is a continuous function, it follows from the intermediate-value theorem 
(Theorem 5.48) that there is a number ¢* for which a < :* < b and f(t*) =r. 
The point P(¢*) is a point of the circle T, so P(t*) is not in 43 and we have the 
required contradiction. Remark: The geometrical nature of a circle in a plane 
is so simple that basic facts involving its interior and exterior are easily described 


and easily proved. The remaining problems of this list involve more general 
situations. 
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17 Sketch some figures to obtain preliminary ideas about matters relating to 
the following definition. Let S be a given nonempty set in the xy plane and let 
P’ be a point of S. Let S’ be the set of points P” in S that can be connected to 
P’ by curves lying in S. This means that P” is a point of S’ if and only if there 
exist functions x(t) and y(#) continuous over an interval a S ¢ < b such that 
P(x(a), y(a)) is P’, P(x(b), y(b)) is P”, and P(x(£), y(t)) is a point in S whenever 
axtxb. The set S’ is a connected set because if P, and P; lie in S’, then 
P, and P: can be connected to P’ by curves C, and Cz lying in S’ and these two 
curves can be combined to give a single curve C connecting P; to P» and lying in 
S'. This set S’ is the maximal connected subset of S that contains P’ or, briefly, the 
component of S that contains P”. 

18 The curve C of Problem 15 is said to have a multiple point at the point Q 
if there exist two numbers ¢; and tz such thata St} < t < b(ora < ty} <h Sb) 
and the two points P(t;) and P(t) coincide with Q. The curve is said to be simple 
(free from multiple points) if it has no multiple points. Give some examples of 
closed curves that have multiple points and some examples of simple closed 
curves. 

19 The French mathematician Camille Jordan (1838-1922) was the first 
person to give serious attention to the difficult question whether each simple 
closed plane curve “separates the plane” into exactly two nonempty components 
one of which constitutes the outside of the curve and the other of which con- 
stitutes the inside. Such curves are called Jordan curves. The following 
theorem is known as the Jordan curve theorem. 

Theorem If Cis astmple closed plane curve (or Jordan curve), then the set S of 
points of the plane which are not points of C contains exactly two (no more and no 
fewer) nonempty components Sı and So. 

The author now owes his readers an explanation. It never has been and need 
not be expected that students of elementary calculus know anything about the 
Jordan curve theorem. Nevertheless, the author insists that each student should 
have opportunities to pick up ideas about mathematics. We can know that the 
Jordan curve theorem is so difficult that Jordan never succeeded in proving it 
and that we must learn more mathematics before we can undertake to under- 
stand proofs that have been given or to construct new proofs. We can know 
that many persons who never generated much interest in additions of fractions 
become intensely interested in problems involving sets and curves. 


7.2 Lengths and integrals Let x, y, and z be given functions having 
continuous derivatives over the closed intervala <t < b. Let 


(7.21) P(t) = P(x), y@), 2) 


so that, for each ż, P(t) is the point having coordinates displayed in (7.21). 
Let r(t) denote the vector running from the origin to P(t) so that, for 
each t, 


(7.22) r(t) = x(t)i + yj + 2(2)k. 


We propose to set up an integral for the length |C| of the curve C traversed 
by P(t), or by the tip of the vector r(#), as ¢ increases from a to b. It 
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could be supposed that our main interest is in lengths of curves and that 
we should get our result as quickly as possible, but this is not true. 
Integrals are very important things, and we proceed slowly to learn as 
much as we can about them. 

Our first step is to make a partition P of the interval a S t S b having 
partition points fo, #1, ° * °, tn and to sketch schematic figures showing 
the partition and points Py, = P(t-1) and P, = P(t) on C. We then 
tell ourselves that if |P|, the norm of the partition, is small then |P, P, 
should be a good approximation to the length of the part of C between 
P,_; and P, so ='P,_1P;| should be a good approximation to |C| and hence 
it should be true that 


(7.23) IC] = lim 2|P; Pl. 


We have employed the fundamental ideas about length given in Section 
7.1. The next step is to put the right member into a more useful form. 
While introduction of the abbreviations may be unwise when acres of 
paper and blackboard space are available, we simplify our formulas by 
setting 


Ath = tk — tk-1, Ax, = x(t.) — x(te-1), Ayk = y(t) — y(te-1), 


Azk = 2(ty) — 2(te-1). 
Then (7.23) gives 


(7.231) IC] = lim 2 +/Ax? + Ay? + Az?. 


We can make this look much more like a limit of Riemann sums by 
introducing factors At, in numerators and denominators to obtain 


(7.232) iC| = lim ) (32) + (32) + (4) Ate. 


The possibility of making further progress is provided by the hypothesis 
that x, y, and z have continuous derivatives. 

We should be realistic and realize that there are different ways to 
proceed. We can cheerfully adopt the view that, however we choose 
t* in the &th subinterval of our partition, the th term in (7.232) should 
be closely approximated by f(t) Atx, where 


(7.233) JO = Vie’ @P + yH + e). 
Thus we can expect that 
(7.234) iC] = lim Xf (t) Atp. 


Since our hypothesis guarantees that f is continuous and hence integrable, 
the Riemann sums do have a limit and we are led to the formula 


(7.24) c = f VORE DOP + OF d. 
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This is, in fact, a correct and very satisfying formula which treats all 
coordinates alike and enables us to calculate lengths of curves that wind 
around through £3. Incase C lies in the xy plane, everything is the same 
except that z(ż) = 2/(t) = 0 and the formula is thereby simplified. In 
applications, it often happens that x(t) = ¢ and the formula is written 
in one or another of the forms 


(7.241) [Cl = j ’ JTF DOr + WOR d 
- Í ? VIF DOET OË de 


b 

_ dy\? dz\? 

ETORO 
and, for plane curves, we have z = 0. It is not easy to be sure that the 
“cheerful derivation” of (7.24) is slovenly. A person can claim that he 
will not use the definition of length given in Section 7.1, that heis interested 
only in curves C for which x'(t), y’(¢), and z'(t) are continuous, and that 
all of the work that he has done merely motivates his definition whereby 
IC| is defined by (7.24). His position is tenable and, so long as he stays 
in his own garden, it is even reasonable. In fact, many quantities in 
pure and applied mathematics are defined by integrals, and sometimes 
the definitions are so well motivated that readers (and even writers) fail 
to recognize the fundamental fact that it is utterly impossible to prore 
correctness of a formula for something that has not been defined. 

We start again with the formula 
(2) Ate, 
At, 


(7.25) IC] = lim ` (e) + (a2) 4 
At; Atk 


which is (7.232) brought up where we can see it, but this time our situa- 
tion is different. We suppose that |C| is defined by (7.25), and we want 
to prove that the right side is an integral. For present purposes, the work 
of the preceding paragraph would be slovenly because nothing was proved. 
We could try to fix everything by trying to prove that there is a tý such 
that, with the notation of the preceding paragraph, the kth term of 
(7.232) is exactly equal to f(t*) At. We do not try this, however, because 
of the fear that the process would involve only uninformative hard work. 
Instead, we start by applying the mean-value theorem (Theorem 5.52) 
to (7.25). Since x’ is continuous, there must be a tý for which m1 < 
tx < t, and 


Ax; _ x(tr) — x(tr—1) — (4% 
(7.251) a Dna Me): 


Similar formulas involving y and z end with y'(tž*) and z’(tf**), there 


420 Curves, lengths, and curvatures 


being no reason for hope that the three numbers te, te, and t*** are 
the same. Thus 


(7.252) |C] = lim & vV [x Ge)? + y GE)? + [r GE**)) Ats. 


This looks much like a Riemann sum formed for the function f having 
values 


(7.253) f® = vir Or + OP + ke’ OP, 


and there is no difficulty in the important special case in which x(t) = 7 
and z(t) = 0 for each ¢. Except in special cases, we must consider con- 
sequences of the fact that tř, tř*, and tf** may be different. Since 
difficulties of this nature (and they were real difficulties in the old days) 
are usually associated with the name of Duhamel (1797-1872), we can 
call this a Duhamel difficulty. To treat all coordinates alike and to be 
precise about this matter, we let Tą be the center of the interval t,_, < 
t S t We can then put (7.252) in the form 


n 


(7.254) Cl = lim & (f(T) + è A, 


-0 k= 
where 


(7.255) ôr = Vix’ Ge)? + by’ ER + e Ge)? 
— Vix (TAP + Ly (Te)? + (TF. 


To prove the desired result 
(726) 10 = [IO d = [? VOF VOF KOF d, 


it is therefore sufficient (and also necessary) to prove that 


(7.261) lim J ôr At, = 0. 


|P >0 k=1 


It is easy to originate the correct idea that the numbers t¥, tž*, and zf** 
are all near Ty, that continuity of the functions x’, y’, z’ implies that the 
two terms on the right side of (7.255) are nearly equal, that the numbers 
6, are all near 0, and hence that the sum in (7.261) is near 0, whenever the 
norm |P| of P is small. The easy way to make this precise is to use the 
fact that the first term in the right member of (7.255) is a continuous 
function of three variables. Using only functions of one variable, we 
can let 0 < € < e and choose a number 6 > 0 such that the numbers 
a, B, and y defined by 


(7.262) [x’(u)]?? — [x (0) =a, [y (u) — [y (0)? = £, 
[z’(u)]? — [z (0)]? = ¥ 
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all have absolute values less than e’ when |x — | < 6. When |P| < ô, 
the right member of (7.255) then has the form 


(7.263) òr = VAr th- Vy 


where the quantities under the radicals are nonnegative and 0 < h < 3¢’. 
It follows that |6.| S +~/3e’ and hence 


(7.264) » ôr An | S p VBE Atr = VBE (b — a) <e 


provided e’ is so chosen that the last inequality holds. This proves 
(7.261) and hence (7.26). 

In order to introduce coordinates on curves, we suppose that x, y, and 
z are functions having continuous derivatives over an open interval 
a <t < band that C is the curve traversed 
by the point P(t) having coordinates (x(t), 
y(t), z(t)) as ż increases. To avoid difficul- 
ties, we suppose that the curve is simple; 
this means that P(t) + P(te) when t ¥ te 
so that P(t) cannot be in the same place at 
two different times. Let to be fixed such 
that a < to <b. Then, as in Figure 7.27, 
we can assign coordinates to points on C by 
letting s be the length of the curve running Figure 7.27 
from P(to) to P(t) when tp St <b and 
letting —s be the length of the curve running from P(t) to P(t) when 
a<t<to Then, whenevera < t < b, the coordinate s(t) of P at time tis 


(7.28) s(t) = [VF OP + DOP + KOF ar. 


In this formula 7 (tau, the Greek #) is used as a dummy variable of integra- 
tion. Since our hypothesis implies that the integrand is continuous, 
Theorem 4.35 enables us to differentiate with respect to ¢ to obtain the 
formula 


(7.281) st) = Vix'@))? + by’ Or + eo, 


which is often written in the form 


dx? dv\2 daN ? 
mw eE 


This is related to the formulas we get when we set 


(7.283) r = x(di + y@j + 2k 
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and calculate the velocity v and the speed |v| from the formulas 


(7.284) v=x (Ði + y Hj + zk 
and 
(7.285) lvl = Vir + by’ OR + [z 60. 


Since the right members of (7.282) and (7.285) are the same, this shows 
that, in appropriate circumstances, the speed determined so quickly by 
taking the absolute value of the velocity is the same as the “speed of the 
particle in its path” determined by use of coordinates on the path. 


Problems 7.29 
1 Find the length of the part of the helix 
x = acost, y = asin t, z = kt 


traversed by a particle at P(x,y,z) as ż increases from 0 to 2r. 
2 Using the first standard equation in 


+j =l, x=acosð, y= bsin ð, 


where 0 < b < a, set up an integral for the length of the part of the ellipse lying 


in the first quadrant. Then use the parametric equations to set up another 
integral for the same length 


2 __ pind x/2 
Ans.: N E EA as, a K «/1 — k? cos? 6 dé, 


where k? = (a? — b?)/a?. These are elliptic integrals, and others more or less 
like them are also called elliptic integrals. There is no easy way to obtain their 
exact numerical values. 


3 Set up an integral for the length L of the curve traced by the point P 
having coordinates 


x = aĝ — b sin 86, y =a — bcos @ 


as 0 increases from 0 to 27r, and show that the result can be put in the form 


2r 4ab 6 
L= (a+b) I q! - g Fy 8" 5 20. 


When b ¥ a, this is an elliptic integral. When b = a, the graph is, as Problem 

16 of Section 6.5 shows, an ordinary cycloid. Sketch a figure and explain a 

simple geometric argument which shows that, when b = a and the approximations 

x = 3 and z? = 10 are used, \/52a < L < 10a. Finally, show that L = 8a. 
4 Find the length of the part of the curve having the equations 


t? — 3 _2-? 
277 *= "9 


which lies between the points (0,—$,1) and (2,—1,3). Ans: L = v2 [19% — 1). 


x = 22%, y = 
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5 When 0 <b < a and a circle (or cog wheel) of radius b rolls without 
slipping inside a circle of radius a as in Figure 7.291, the path traced by a point 
on the small rolling circle is a hypocycloid (inside-cycloid) that appeared in one 
of Problems 6.59 and that even small dic- 
tionaries describe. It can be shown that 
when the center of the small circle has 
passed over an angle @ (that could be wz), 
the point initially at (2,0) has coordinates 


a—b 
b 


a—b 


y = (a — b) sin 6 — b sin 7 6. 


x = (a—b)cos6+ b cos 


6 


Figure 7.291 


With the aid of Figure 7.291 find the 
amount by which @ must increase as P traverses one arch of the cycloid from the 
big circle back to the big circle. Then find the length of one arch of the cycloid. 
Ans.: 8b(a — b)/a. 

6 Itis a remarkable fact that when b = a/4, so that the hypocycloid of the 
preceding problem has four cusps, the equations can be put in the form 


x = 


(3 cos 6 + cos 36) = a cos? 6 


mI BIA 


y = 7 (3 sin 8 — sin 36) = a sin? 6 
so that 


x34 + ys = gs, 


Using this formula, find the length of the part of the hypocycloid in the first 
quadrant. Ans.: 3a/2. 

7 Let fand g have continuous derivatives over a S u S b. Let C be the 
curve which the point P(f(u), g(u)) traverses in an xy plane as u increases from 
atob. Let C be regarded as a wire having linear density 6 so that a piece of C 
of length L has mass 6L. Let p be a nonnegative integer and let x9 be a constant. 
Set up an integral for the pth moment of the wire about the line having the 
equation x = xo. 

8 With or without assistance from the preceding problem, set up an integral 
for the pth moment about a diameter of a circular wire having linear density ô. 

9 Centroids of triangles, rectangles, and regular polygons coincide with the 
centroids of the regions that they bound. 
Draw some polygons for which there is violent 
departure from coincidence. 

10 Figure 7.292 is intended to steer our 


Figure 7.292 


thoughts toward a wire or cord concentrated o (%3, 
on a curve C and to make us realize that we l 
have not yet calculated the gravitational ! 
force F upon a particle P* of mass m at i 
P(x,y,z) that is produced by the wire. Set 1 


up an integral for F. Outline of solution: 
Since x, y, z have been preempted, we take 
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the curve C to be the ordered set of points P(t) = P(u,o,w) for which u, v, w 
are functions of ¢ having continuous derivatives and 


(1) OP = u(i + o()j + w(e)k (a StS), 


and suppose that the linear density (mass per unit length) of C at P(e) is 8(9, 
where ô is a continuous function of t. Letting Q be a partition of the interval 
a S t £ b with partition points ¢, and supposing that %1 S tt Z tp, we take the 
number 8(t;) |P,:P;| as an approximation to the mass of an element of the 
wire. The force AF, which this element produces upon the particle P* of mass 
m at P(x,y,z) should have approximate magnitude 


Gmô(t)| PrP. 
2 |PP(tz) I? 


Introducing vectors and coordinates gives the approximation 


* Auz \? Ao; \? Awg \? 

(3) AF, = Gmô(tz) (=) + (=) + (=) 
[u(tz) — xli + [o(e) — yli + [wGe) ~ z)lk At 

(ulk) — x]? + lok) — yl]? + lwl) — a2} 


The limit of the sum of these things should be F, and we are led to the definition 


(4) F=Gm f a ôt) viu (N) + E + o)? 
u4) — ali + PG — yi + oo - alk 
{[u(z) — x]? + bG) — y]? + [w@) — z} 
This is our result. In case the wire coincides with a circle of radius a in the yz 


plane having its center at the origin, we can set u(t) = 0, o(¢) = a cos t, w(t) = 
asin ¢ and put (4) in the form 


(5) F = Gma f. °F s ed + [a cost — ylj + [a sin ż — glk 


(a? F [a cos + — yl? F [a sine — a} ” 


In case P* is on the axis of the wire so that y = z = 0, this takes the much simpler 
form 


aooo La A asin ck 
(6) F = Gma f° ao A a 


In case the wire is a uniform wire so that, for some constant 59 we have 6(t) = ĝo 
for each ż, the coefficients of j and k are zero because 


(7) J, costa = [sin edt = 0 
0 0 
(or, as is usually said, “because of symmetry”) and (6) reduces to 
GmMx 
8) F = — G+ Ah 


where M = 2raôo the total mass of the wire. Remark: With slight differences 
in notation, (8) was used extensively in some of the Problems 4.79. 
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11 Let units of measurement be so employed that the potential at Po(xo,4'0,20) 
due to a particle of mass m concentrated at a point P(x,y,z) is m/ |PoPI?. Let 
x, y, z have continuous derivatives over a S u S b, and let P(x,y,z) traverse a 
curve C as u increases from a to b. Let C have linear density ô. Let Po(xo,yo,20) 
be a point not on C. Remembering that potentials are scalars, and remembering 
or learning quickly that the potential due to a collection of particles is equal to the 
sum of the potentials due to the individual particles, set up an integral which is 
equal to the potential at Po due to C. 

12 Set up an integral equal to the moment of inertia (second moment) 
about the z axis of the curve C of the preceding problem. 

13 When a particle has fallen (or slid without friction) from height G(d) to 
a lower height G(u), its speed is ~/2g[G(b) — G(u)], where g is the acceleration of 
gravity. We all know that if we travel a short distance with a constant speed, 
we get the time required for the trip by dividing the distance by the speed. 
With this basic information, we are prepared to attack a problem. Let F and 
G be functions having positive continuous derivatives over 0 S$ u < b and let 
the point P(F(u), G(u)) traverse a curve C from the origin to a point B in the 
first quadrant of an xy plane as u increases from 0 to b. Sketch a schematic 
figure showing the curve C. Set up an integral which gives the time T required 
for a particle starting from rest at B to slide without friction down the curve C 
to the origin. 

14 The reflecting surface of a headlight is a part of a paraboloid, of depth 
4 inches and diameter 12 inches, obtained by rotating a part of a parabola about 
its axis. Find its area. Solution: This is not an easy problem, since a basic 
difficulty lurks in the fact that areas of such surfaces have not been defined. 
We start by so determining & that the parabola having the equation y = kx? 
contains (or passes through) the point 
(6,4). This gives y = gx? for the equa- y 
tion of the generating parabola. With 
the possibility of setting a = 0, b = 6, 
and f(x) = 9x2, we suppose that f has 
a continuous derivative over an interval 
asxxzb. Let S be the surface gen- 
erated by rotating, about the y axis, the 
part of the graph G of y = f(x) for which 
asxsb. Expecting tousesome intui- 
tive ideas about areas, we make a parti- Figure 7.293 
tion P of the interval a < x <S b and 
consider one particular subinterval x,-1 S x S xk- As an approximation to 
the length L of the segment G, of the graph G containing points x for which 
Xp~-1 L x S xk, we may use the number 


(1) V Axi + Ay? = 4/1 + (=) Ax, = V1 + Lf (xz)]? Axr 


k 


x} x 


where Ax, = xk — xn-1, Aye = f(x) — f(%e-1), and x, is a number for which 
x-1 < xp < xp Figure 7.293 is helpful. Even though areas are not yet defined. 
we can have a feeling that when the segment Gx is rotated about the y axis, its 
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points all travel approximately the same distance 24x, and that the segment 
generates a surface essentially like a ribbon having width L, and length 2r% 
This leads us to the intuitive idea that one term in the sum 


(2) |S] = lim 22a V1 + U(x)? x: Axy 


should, when the norm |P| of P is small, be a good approximation to the area of the 
part of S generated by one segment of the graphG. The next step is to adopt the 
tentative intuitive conclusion that the sum is a good approximation to the total 
area [S| of S when the norm of P is small or, in other words, that (2) should be 
valid. Our theory of Riemann sums and integrals assures us that if (2) is true, 
then 


(3) \S| = 2r Í > VIF IOP « dx. 


If we have sufficient confidence in our calculations (it would, of course, be fatal 
to use an incorrect formula for the circumference of a circle of radius x+) and in 
our intuitive ideas, we can install the formula (3) as one (not the) definition of 
area of surfaces of revolution. It does not make sense to claim that this definition 
is “correct,” but experience shows that it is useful and this is all that we can 
expect from definitions. For the case in which f(x) = kx?, a = 0, and b = 7, 
(3) reduces to 


IS] = 2r f (1 + 4k?) Yor da = aa [C + 4era — 1). 


When k = and r = 6, this reduces to [S| = 49r. 

15 We invest a little time to look at some curve integrals that are called line 
integrals by those who adhere to the notion 
that curves are lines. Let functions x(t), y(t), 
z(t), the point P(t), the vector r(t), the curve 
F C, the partition P of a £ t S b, and the num- 
p Ar, bers Azz, Axz, Ayk, Azk be defined as in the part 
of this section preceding (7.231). Leta vector 
y function F having scalar components fı, fo, fs 
be defined and continuous over a part of E; 
< that contains the curve C. We consider F to 
Figure 7.294 be a force which operates upon a particle at 
P(t) as the particle moves from P (a) to P(b). 
The schematic Figure 7.294 may be helpful. With the idea that the work done 

by F as the particle moves from P(tz-1ı) to P(t) is closely approximated by 


zZ 


(1) F (hs Yk,Zk)*ATk 


when the norm of the partition P is small, we can define the total work W done 
by F by the formula 


(2) W = lim ZF (xi, yx,2%)* Are. 
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The right member of (2) is an example of a curve integral, and it is denoted by 
the symbol in 


(3) W = | ¥(e)-dr, 
where F(r) is the vector function of the vector r defined by 
(4) F(r) = F(x,y,2) 


in which x, y, z are the numbers for which r = xi + yj + zk. The right member 
of (3) is read “‘the integral over C of F(r) dot dr,” the fundamental idea being 
that it is the curve C that is partitioned to produce the approximating sums. 
To learn something about this curve integral, we can use the formulas 


(5) F (x,y,z) = fil%,y,2)i + folx,y,2)j + fs(x,y,2%)k 
(6) Ar, = Axi + Ayj + Azk 


to put (2) in the form 
(7) W= lim X | fule(4), y(t), 2(te)) a + folx(te), y(tx), 2(t)) om 
+ falet), ya), 10) Ze | At 
A proof very similar to the one centering around (7.255) enables us to show that 
(8) W = J. fi(x(2), y@), z(t))x (E) dt + J: Fallt), y(t), 2(t))y’(t) dt 
+ S AEO, IO, G) at 


This is a formula from which W can be calculated or approximated. Since we 
are not electronic computers, we do not (at least at the present time) make cal- 
culations, but we do point out that the integrals in (8) are abbreviated to those 
in the formula 


(9) W = |o Liy) dx + falesys2) dy + falesy,2) da. 


The integrals in (9) are scalar curve integrals. Whenever we want to know 
what these things mean and how they can be evaluated, we should have the wits 
to check back to see what they abbreviate. While it may be possible to over- 
emphasize the importance of the matter, we can observe that if we set 


(10) F(t) = fa(x,y,2)i + fo(x,y,2)j + fo(x,y,2)k 

and make the pretense that dr is a vector for which 

(11) dr = dxi + dyj + dzk 

and 

(12) F(r)-dr = f,(x,y,2) dx + folx,y,z) dy + f(x,y,z) dz, 


then we can substitute (12) into (3) to obtain (9). Of course, it should be thor- 
oughly understood that we have done nothing but explain symbols. This would 
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be a stupid waste of time if it were not for the fact the symbols are useful. They 
appear even in quite elementary physics and engineering, and mathematicians 
have a responsibility to tell what they mean. 


7.3 Center and radius of curvature Courses and textbooks ip 
“differential geometry” provide information about matters relating to 
curvature of curves that lie in £3. In this section, we give most of our 
attention to curves that lie in an xy plane. Our approach to the subject 
sacrifices brevity to place emphasis upon elementary geometric ideas 
that can be of interest to everyone and are needed by engineers and others 
who study the bending of beams. 

Let x and y, or x(t) and y(t), have continuous second derivatives over 
some open interval a < t < b in which ¢ and ¢t + At are always supposed 
to lie. Let P(t) denote the point with coordinates (x(t), y(¢)) and let 
r(t) be the vector running from the origin to P(t) so that 


(7.311) rt) = x«@)i+ yQ@)j 
(7.312) vit) = x (i+ yj 
(7.313) a(t) = x" (i + y” (Oj. 


Let C be the curve traversed by P(t) and the tip of r(t) as t increases, so 
that v(t) is tangent to C at P(t) when v(t) = 0. Henceforth we consider 
only values of ¢ for which 


i j K 
(7.32) v@) xaQ@) =| LA yA 0] = r ay e — x ey ek = 0. 
w(t) yA 0 


Since |v x a| = |v| ja] |sin 6], this means that v and a are nonzero vectors 
which do not lie on parallel lines. This and the first of the formulas 


v(t + At) — vit) £0 


. v(t + At) — v(t) 
(7.321) lim — ar IN 


Ai =a(t), v(t)x 


implies existence of a ô > O such that the second holds when 0 < |At| < ô. 
Since v(t) X V(t) = 0, we conclude that v(t) x v(t + At) Æ O and hence 
that the tangents v(t) and v(t + At) do 

Figure 7.33 . notlieon parallel lines when0 < |Az| < 6. 
<c Supposing that 0 < |Az| < ô, we 
construct Figure 7.33 and look at it. 

Since the tangents v(t) and v(t + At?) 

P(t+At)= to C at P(t) and P(t + At) do not lie 

(En) P(x(t+ At), y(¢+44) on parallel lines, the normals to C at 
these points must intersect at a point 

(fn) having coordinates ¢ (xi) and 

PC) = POLO IO) n (eta). If C is a circle, then the 
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most elementary of elementary geometry tells us that the points (&,7) 
obtained for different values of Az would all lie at the center (X,Y) 
of the circle. This is no time to be devoid of ideas. We can have at 
least a vague feeling that, even when our curve C is not a circle, a small 
section of C in a neighborhood of P(t) should be so much like a circle 
that there should be a point (X,Y) such that (£n) is near (X,Y) whenever 
Atis small. If all this happens, we should give the point (X,Y) a name 
and find formulas for X and Y. All this does happen. The point is 
called the center of curvature of C at P(t). We shall find formu’as for 
X and Y and for the distance from (X,Y) to P(t). This distance is called 
the radius of curvature p (rho) of C at P(t). The circle with center at 
(X,Y) and radius p is called the circle of curvature of C at P(t). 
The coordinates ¢ and 7 are determined by the system of equations 


[£ — x(t)]x’(2) + in — y(t)]y’(2) =0 
[E — x(t + Atx (t + At) + [n — y(t + Aty’ (t + At) = O. 


The left member of the first equation is the scalar product of the vector 
v(t) tangent to C at P(t) and the vector running from P(t) to (&), and 
the equation expresses the fact that the two vectors are orthogonal. 
Similarly, the second equation expresses the fact that (&) lies on the 
normal to C at P(t + At). Replacing the quantities in brackets in the 
second equation by 


[i — x(t) — x(t + At) +x()] and n y) — ye + At) + ye) 


and transposing a part of the result enables us to put the two equations 
in the form 


(7.341) [E — x(¢)]x'(2) + In — yy) = 0 
(7.342) [i — xl + At) + In — y@)ly’@ + At) = Q, 


where we have simplified matters by letting Q denote the quantity 
(7.343) Q = [x(t + At) — x(2)]x’(t + At) + [y(t + At) — yaly E + At). 


To eliminate 7 from the two equations (7.341) and (7.342), we multiply 
the first by y'(t + Az) and the second by —y’(#) and add to obtain the 
first of the formulas 


(7.35) Dig — x] = —Qy’@®), Din — y] = Ox"), 
where D is the determinant 

D = x'(t)y'(t + At) — x’ + Addy’) 
which can be put in the form 


(7.351) D = x'()[y'(t + A) — ¥'@] E + Ad) — Oy O. 
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A similar procedure, in which the multipliers —x’(¢ + At) and x(t) are 
used, gives the second of the formulas (7.35). Formulas for ¢ and 3 
are now obtained from (7.35) by dividing and transposing. As we shall 
see, formulas for X and Y are obtained by dividing (7.35) by At and taking 
limits as At—> 0. Since (7.343) and (7.351) show that 


lim 2 = WOP + D'O 

lim = OO -OV O, 
we find from (7.35) that 

Di ___&OP+Y'OR_, 
(7.361) X — x(t) = lim [— — x(t)] = Sy A — y A y(t) 


o — @P+h’OP , 
(7.362) Y — y(t) = lim [n — y@)] = "OUO ELOTON (t). 


Transposing the terms x(t) and y(t) gives formulas 


-aa PORDO 
(7.363) X= x(t) OTHO) — OTHO) y (t) 
(7.364) Y= y() + OP + OP re 


x(t) y(t) — x” Ey (2) 


for the coordinates (X,Y) of the center of curvature of C at the point 
(x(t), y(t)), but these new formulas are sometimes less useful than their 
parents. The definition of radius of curvature p implies that 


p= VIX — x0) + [Y - yO? 
and hence 
— (rO + yA 
737) e= @y"® = OVO 


In case x(t) = t so x’(t) = 1 and x’’(z) = 0, it is customary to replace t 
by x and write 


2713 
voy hye T 


To steer our thoughts toward another problem involving curvature, 
we consider the rate of change of direction of an automobile which trav- 
erses a level road that winds around swamps and between mountains. 
At each time ¢ the rate depends upon the speed of the automobile and 
upon another number which is called the curvature of the road at the 
position of the automobile. To be more precise about this matter, let 
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y and y be functions of t having two derivatives each over some interval 
and let C be the curve traversed by the point P(t), having coordinates 
x(t), y(t), as t increases over the interval. We suppose that there is no 


+ for which x’(t) and y’(t) are both 0. Our results will be obtained with 
the aid of the three familiar vectors 


(7.381) rit) = xit yj 
(7.382) vit) = LA t yj 
(7.383) a(t) = x” (Hi + y” Oj. 


The vector V(t) having its tail at P(t) is the forward tangent to C at P(t). 
With each t we wish to associate an angle ¢(t) that determines the direc- 
tion of v(t), and this is a matter that must not be treated carelessly. 
To restrict (t) to an interval like —r < @ < r, so that ¢ would be dis- 
continuous when the vector rotates from northwest to southwest, would 
defeat our purpose. To formulate general principles by which ¢(t) can 
be calculated may be beyond our capabilities. Let us then avoid possi- 
ble unforeseen topological difficulties by restricting attention to curves C 
for which it is clearly possible to determine ¢(#) by the following proce- 
dure. Let to be a particular ż in the interval considered and let (to) be 
the angle for which —r < ¢(to) Sm and ¢(to) is the ordinary trigono- 
metric angle having its initial side on the positive x axis and its terminal 
side on the vector through the origin parallel to v(toọ). Then, as ż increases 
(or decreases) from to, let #(t) vary with the vector v(t) in such a way that 
¢ is continuous. Some curves are less complicated than that shown in 
Figure 7.384, and some are more complicated. Supposing that ¢(t) is 


Figure 7.384 


satisfactorily determined, we can put the formula (7.382) for v() in the 
form 


(7.385) Iv(2)|[cos Hi + sin 4j] = v, 


where 


(7.3851) x'(t) = |v(z)|cos e), y ( = |V@Isin 4) 


and 


(7.3852) wl = {e + y) = 0. 
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Since we want a formula for ¢’(t), it is reasonable to differentiate (7.385) 
with respect to ¢ and to try to use the result. We divide by |v(t)| and 
then differentiate to obtain 
d 
lv@)la@) — ve) = vO 
(7.386) [— sin ¢(t)i + cos jle (t) = — Rko 


The coefficient of ¢'(t) is the unit vector n for which 


(7.3861) n = [— sin ¢(t)i + cos (Hj) = ee x (t)j_ 


Since n-v(t) = 0, we can equate the scalar products of n and the mem- 


bers of (7.386) to obtain 


$'(t) = volane _ [x"@i+ yO- y Hi + OI] 
o OP [v(e)|? 
and hence 
x (y E) — x" ey E), 
[x’(¢)]? + [y 6)? 
For some applications of (7.387), we can set x(t) = t so that x’(t) = 1 
and x” (t) = 0. In such cases, we can replace t by x to obtain 


(7.387) ¢’(t) = 


, na Ve) 
(7.3871) p'(x) EOE 
In this, and in any other case in which x’(t) ¥ 0, we can eliminate prac- 
tically all of the work of this section by writing 


_y® 
(7.3872) tan ọ = PIO 
with or without the aid of (7.3851) and then differentiating with respect 
to ż to obtain (7.387). 

Returning to (7.387), we note that ¢'’(t), which might be measured in 
radians per minute, gives the time rate of change of ¢ with respect to t. 
In case the point P(t) traverses C with unit speed, which might bel 
kilometer per minute, ¢’(t) becomes also a number of radians per unit 
distance measured along C, and this is called the curvature K or K(t) of 
C at P(t). To give curvature an additional leg upon which to stand, we 
introduce upon C a coordinate system like that in Figure 7.27 with the 
stipulation that the coordinate s of P(t) increases as ¢ increases. The 
curvature of C at P(t) can then be defined by the formula 


(7.388) K = K(i) = &. 
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Since d¢/ds = ¢'(t)/s’(t) = ¢'(t)/|n(z)|, the formulas (7.387) and (7.3852) 
yield the formula 


_ x'(y"() — xyi 
(7.389) £ = TOP Sy ORT 


from which the curvature of C at P(t) can be calculated without reference 
to other formulas. Perhaps we should take notice of the fact that a curve 
C is an ordered set of points, and that we can misunderstand (7.389) 
when we forget this fact. In particular, the sign in (7.389) will be wrong 
if we put the coordinate system on C backwards so that s decreases as ż 
increases. 

We conclude with a fundamental observation. As formulas (7.37) 
and (7.389) show, the radius of curvature p and the absolute value |K\| of 
the curvature K are reciprocals wherever we have defined both of them. 


Problems 7.39 
1 By use of the equations 
x = acost, y =asint, 


show that the curvature of the curve C consisting of a circle of radius a traced in 
the positive direction is identically 1/a. Then by use of the equations 


x =acosi, y = —asini, 


show that the curvature of the curve T (capital gamma) consisting of a circle 
of radius a traced in the negative direction is identically —1/a. 

2 Hindsight can be very good. Look at (7.388). Then run with constant 
speed and in the positive direction around a circle of radius a and observe that 
@ increases at a constant rate. Then reverse the direction of the run and observe 
that @ decreases at the same constant rate. 

3 Determine the radius of curvature of (that is, at points of) the parabola 
having the equation y = kx®. Find the minimum radius of curvature. Sketch 
a graph for the case in which k = 1 and determine whether the answer seems to 
be correct. 

4 Show that the normals to the graph of y = x? at the points (0,0) and (0.01, 
0.0001) intersect at the point (0, 0.5001). Show that this intersection is at dis- 
tance 0.0001 from the center of curvature of the graph at the point (0,0). 

5 A glance at the graph of y = log x suggests that the absolute value of the 
curvature is greatest and that the radius of curvature is least when x is somewhere 
between + and 2. Find the x for which the radius of curvature attains its mini- 
mum value. Ans.: \/2/2 = 0.707. 

6 When a point P(x,y) moves along an arc or curve C having equations 
x = x(t), y = y(t) that satisfy appropriate conditions, the center (X,Y) of curva- 
ture moves along an arc that is called the evolute of C. The formulas (7.363) and 
(7.364), which we should be able to use but need not remember, show how X 
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and Y depend upont. Supposing P moves along the graph of y = kx? in such a 
way that x =zand y = kt’, show that the equations of the evolute are 


l 2 
X = —4kt, Y = 7 + 3kt?. 


7 This problem involves a little story. A string is wound in a clockwise 
direction around a circular spool of radius a with an end at the point (a,0). 
When the string is unwound, being kept stretched during the pracess, the end 
of the string traces the spiral curve C shown in Figure 7.391. When að units 


Figure 7.391 


of string have been unwound, this part of the string is tangent to the spool at 
the point Q(X,Y) for which 


X = a cos 6, Y = asin 0 
and the end of the string is at the point P(x,y) for which 


x = x(0) = a cos 6+ að sin 0 
y = 9(0) = asin 6 — að cos 8. 


It can be observed that Q moves around the circle just as rapidly as the distance 
from Q to P increases. It is not unreasonable to guess (or at least to consider the 
possibility) that the center and radius of curvature of C at P are Q and aĝ. On 
the other hand, a skeptic can be uncertain whether QP is perpendicular to the 
tangent to C at P. The situation demands clarification. Start with the equa- 
tions of C and find the center and radius of curvature of C at P. Ans.: (X,Y) 
and aĝ. Remark: The curve C is called the involute of the spool. The spool 
ts the evolute of its involute. 
8 Determine the radius of curvature of the “standard ellipse” 


x? y? . 
atp”! or x = a cos l, y = b sinl 


by use of the first “standard equation” and then by use of the latter parametric 
equations. 


9 Using the parametric equations of Problem 8, find the evolute of the 
standard ellipse. 
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10 Asan alternative to (or in addition to) Problem 8, find the radius of curva- 
ture of the “standard hyperbola.” 
11 A glance at the hypocycloid having the equations 


x = a cos? t, y = asin?z 


indicates that the radius of curvature p should be a maximum at points (x,y) 
for which |x| = |y| and a minimum at points for which xy = 0. Can we believe 
our eyes? Ans.: No, because p has no minimum. When? Æ nr/2, calculations 


give p = 72 Isin 2z|. Thus p is a maximum when 2: = (x + 3)” or i= 


G + i)” and hence when |sin z| = |cos z| or |x| = [yļl. We see that p — 0 as 


t— 0, but the curvature at the cusps is undefined. 

12 When a flexible cord or chain (the Latin word for chain is catena) is sus- 
pended from its ends in a parallel force field, it hangs in a curve (or point set) 
called a catenary. Differential equations textbooks show that a rectangular 
coordinate system can be chosen in such a way that the equation of the catenary 
1s 

a 


= 5 (ezla -+ ezla), 


Find the radius of curvature of this catenary at the point (x,y). Ans.: y?/a. 
13 Find the radius of curvature of the cycloid having the equation 


x = a(b — cos 9), y = a(l — cos 6). 


14 Find the evolute of the cycloid of Problem 13. 

15 Persons who picnic beside lakes several miles long can wonder whether 
poor visibility instead of curvature of the earth is responsible for invisibility 
of distant boats and shores. This problem involving curvature can be solved 
very simply. Figure 7.392 shows the center of a spherical earth at C on the 


Q(x,0) 


Figure 7.392 


positive y axis, the x axis being tangent to the surface of the earth at O. The 
line from C to Q(x,0) intersects the surface of the earth at P. The number IPO] 
is the height 4 of an object which is visible from the point O on the earth x miles 
away. The simple calculation 


x2 
1 = «/R2 2. R n — 
(1) h R? +a? — R VRF x +R 
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shows that the approximate formula 


x2 
(2) h = 7R 
gives good results when x is small in comparison to R. It is a quite remarkable 
fact that if h and R are measured in miles and if the height H of the object is 
measured in feet so that H = 5280}, then we can put R = 3960 in (2) and multi- 
ply by 5280 to obtain 

H = §x?. 


Putting H = 6 shows that only the hair on the top of the head of a man 6 feet 
tall is visible from a point on the earth 3 miles away. Putting x = 30 gives 
H = 600 and shows that less than the top half of the Empire State Building 
can be seen by persons on a ship 30 miles away. 

16 It is sometimes useful to have formulas obtainable from (7.361) and 
(7.362). Letting N [where the N can make us think of a normal to C at P(t)] be 
the vector running from P(t) to the center (X,Y) of curvature, we see that 


[x’(e)}? + [y’@)]? Hee 1 atten 
(1) N = OORE UOO) [~y (ti + x’ (Oj). 


Let b (where the b can make us think of binormal or “second normal”) be the 
unit vector in the direction of v(t) X a(t) so that, as (7.32) shows, b is the coordi- 
nate vector kin our work. Then, with the aid of (7.32) and the fact that b-b = 
1, we can put (1) in the intrinsic form 


MORLO 


(2) N = A x a@rbo 


b(t) X v(t) 
in which coordinates do not appear. The formula (2) is valid when C is a curve 
in Es for which x,y,z are functions having continuous second derivatives and 


rt) = «(i+ yj + 2@)k 
v(t) = #/@i+ y'(t)j + 2’Ok 
a(t) = x” (i + y” Ej + z” )k 


whenever ż is such that v(t) X a(t) = 0. We shall not prove this, but remark 
that the point (&,7,¢) analogous to the point (£7) of Figure 7.33 is the intersection 
of three planes and that (X,Y,Z) is the limit as At > 0 of (&,7,¢). 

17 Jt is possible to obtain very informative formulas by considering the 
motion of a particle which moves along a plane curve C, endowed with coordinates 
as in Figure 7.27, in such a way that s increases as ż increases. We assume 
existence and continuity of all the derivatives we want to use, and we assume that 
dx/dt > 0. Let t be the unit forward tangent vector to C at time ż so that 


(1) t = cos gi + sin @j, 


where ¢ is, as in the discussion of Figure 7.384, an angle giving the direction of t 
at time ¢. Show that differentiating (1) gives 


dt 
2 z = [— sin $i + cos gil $ -nB 


n3 g 7 g E 
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where n is the unit normal obtained by rotating t clockwise through the angle 


m/2, ds/dt is the speed of the particle, and K is the curvature dø/ds of C. Show 
that writing the formula v = |y|t in the form 


ds 

(3) v= ai t 
and differentiating give 

dv d's ds dt 
(4) a= ~mttaa 
and that use of (2) then gives 

d*s ds\? 
(5) a=Set+(S Kn. 


Remark: This elegant formula gives the normal (or transverse) and tangential 
scalar components of the acceleration in terms of the speed and rate of change of 
the speed of the particle. The simplest applications involve the case in which the 
particle moves along the curve with constant speed o so that ds/dt =o and 
d?s/dt? = 0 at each time. In this case, (5) reduces to the simple but important 
formula 


(6) a = o?Kn. 


If K = 0, then a =O. If K #0, then a has magnitude o?|K| or o?/p (where p 
is the radius of curvature) and is directed toward the center of curvature. Use ot 
(6) and the formula F = ma gives the force required to propel a particle of mass m 
along a curve C with constant speed ø. 

18 Let C be a simple closed convex curve which could, for example, be a 
famous “triangular roller’ composed of the three vertices of an equilateral tri- 
angle together with three circular arcs of which each has its center at one vertex 
and contains the other two vertices. Let R be a rod (or line segment) whose 
length L is small enough to permit the rod to be “‘slid around” C in such a way 
that its two ends both remain on C. It is easy to presume, as has sometimes 
been done in “‘proofs” of the “Holditch theorem,” that each point P on R must 
traverse a simple closed convex curve Cp as R slides around C. Persons having 
a compass, a straightedge like the edge of a sheet of paper upon which marks 
can be placed, and some spare time at their disposal can sketch interesting 
figures. Surprises await those who let C be a triangular roller, let the length ¿ 
of the rod R be equal to or only a bit less than the distance between two vertices 
of the equilateral triangle, and let P be the mid-point of L. 
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8.1 Trigonometric functions and their derivatives We are study- 
ing and perhaps even learning mathematics, and it may be amusing and 
perhaps even useful to see how an unscientific but highly critical Justice 
of a Supreme Court might be introduced to angles and trigonometry. 
We would teach him enough about numbers, £2, continuous functions, 
and curves to make him realize that if we start with positive numbers a 
and h for which 0 < h < a, then the ordered set of points (x, ~/a? — x?) 
for which k S x <S a, the point for which x = xe preceding the point 
for which x = xı when a S xı < x2 Sh, is a curve. Our curve lies on 
the circle with center at the origin and radius a, and we simplify (or 
complicate) matters by calling our curve “the arc, of the circle with 
center at the origin and radius a, which runs in the positive direction 
from the point (a,0) to the point Q having coordinates (h, ~/a? — h?)”’ 
or, for temporary purposes, simply “the arc.” To help us remember the 
meanings of our symbols, we can start constructing Figure 8.11 and 


amplify it as we proceed. Next we must teach our pupil a theory of 
438 
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length and prove to him that the arc [IF 
has a length which we may denote by s. OG va K) 
Knowing that our quantities depend 
upon A and a, we can define a number 6 
by the formula 


_s _ length of arc 7 A(a,0) x 
(8.12) 0 = a radius Figure 8.11 


If we multiply each of a and h by the same positive constant i (lambda), 
we get a new radius and a new arc, but the ratio s/a remains the same. 
To prove this, we must prove that the length of the new arc is the product 
of and the length of the original arc. Our theory of length enables us 
to do this because, by drawing radial lines from the origin, we can see 
that each polygon inscribed in the new arc determines (and is determined) 
by a polygon inscribed in the original arc and that the lengths of the 
straight segments of the polygons all differ by the same factor \. All 
this shows that we get the same 6 when we take another pair of values of 
aand h for which the point (h, „a? — h?) lies on the same half-line extend- 
ing from O through Q. Now we can again simplify (or complicate) 
matters by considering the number @ to be “a measure of the amount of 
rotation required to bring a line from the position OA to the position 
OQ” or “a measure of the opening between the lines O4 and OQ.” Per- 
haps to remind us where the number 6 came from, or perhaps to indicate 
something of which 6 is to be considered a measure, we complete Figure 
8.11 by inserting the 0 together with the curved arrow which shows the 
direction of our arc. It is the fashion to call 6 an angle, but it should be 
permanently remembered that @, like 5, is a number. The facts that we 
sometimes use 0 to measure an amount of rotation and use 5 to measure 
a number of fingers do not imply that @ is a rotation and that 5 is a fist- 
ful of fingers, but we can nevertheless understand and even use the more 
or less convenient terminologies involving “angles” that have become a 
part of nonscientific as well as scientific attempts to convey information. 
If all this indicates that trigonometry is a subject much too difficult for 
inclusion in trigonometry textbooks, we do have one consolation. The 
hard work is done and the rest is easy. 

Our theory of curves is sufficiently general to allow us to extend the 
above account of positive (that is, counterclockwise) arcs and angles to 
cover situations in which the arc is longer and Q lies in the second or 
third or fourth quadrant. Moreover, the arc can be so long that we 
must encircle the origin more than once to traverse it, and the number 
(or angle) 0 is still defined by the same formula (8.12). In case the 
arc starts at 4 and is oriented in the negative (or clockwise) direction, 
everything is the same except that the directions of the arrows are 
reversed, a negative sign is prefixed to the middle and last members of 
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(8.12), and the number @ is negative. Even though we could pretend to 
be appalled by the idea that numbers have sides, we bow to conventions 
and agree that the way to find the six fundamental 
trigonometric functions of a given angle @ is to con- 
struct “‘the terminal side of 6” as in Figure 8.13, pick 
a point P(x,y) on this terminal side, let r be the dis- 
tance (positive, of course) from O to P, and use the 
numbers x, y, rin the usual way. 

It is our purpose to present formulas for deriva- 
tives of trigonometric functions with derivations to 
which no logical objections can be raised. We could undertake 
to use the formula 4 = $6r? for the area of a circular sector of 
radius r which has central angle 6, but it would be immediately 
recognized that this formula has not been proved. Even if it be conceded 
that we know that the area of a whole circular disk of radius r is mr?, it 
is only the docile acceptance of a crafty fraud that would allow us to 
“see at once” that the sector “obviously” has area 467? because the area 
of the sector divided by the area of the circle is “obviously” 6/27. Noth- 
ing is obvious. Even a hazy understanding of the theory of area is 
enough to show that nothing can be proved without making substantial 
use of precise definitions and completeness of the real-number system 
or, what amounts to the same thing, consequences of these things. Since 
we do not now wish to be responsible for furnishing a complete treatment 
of areas of sectors, we shall base our work on the inequality (7.12). This 
inequality, the truth of which should seem thoroughly reasonable to a 
person at B who has the choice of walking two paths to D, has a virtue. 
It has been proved. 

It was indicated in Problem 12 at the end of Section 3.2 that we will be 
able to derive the formulas for derivatives of trigonometric functions 
when we have proved the two formulas 


Figure 8.13 


L 


(8.14) lim 228 1, jim L998? o, 


8—0 8—0 


Like a man who has drilled a hole and filled it with blasting powder to 
split a rock, we are ready to produce results. Figure 7.11 can be put 
Figure 8.141 into a coordinate system as in Figure 
(x3) 8.141, and we supposethat0 < 0 < w/2. 

> In terms of the notation of this figure, 
the inequality (7.12) becomes 


yss <sy+(r-—x). 
Dividing by r gives 
k , 
(r—x) (8.142) sinð < 6 < sin + (1 — cos 0). 
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Since 1 — cos 0 > 0 and 1 + cos 0 > 1 and sin 6 < 6, the last term can 
be overestimated by the inequality 


(8.143) 0 < (1 — cos 6) < (1 — cos 6)(1 + cos 0) 


= 1 — cos? 0 = sin? 0 < 9 sin 9. 


Hence we can replace the last term in (8.142) by @ sin 0 and divide by 
sin § to obtain the first and hence the second of the significant inequalities 


0 1 sin 0 
< _ + —___ — 


Dividing (8.143) by 0 and using (8.142) gives 


(8.145) 0< += cos 8 < sin 0 <8. 


The above inequalities have been proved to hold when 0 < 0 < 72/2. 
Since |6| = 6 when 0 > 0, (8.144) and (8.145) imply that the inequalities 


1 sin ð 
i+] <3 


(8.146) <1, 1 — cos 6 


hold when 0 < 6 < 2/2. Since the members of these inequalities are not 
changed when we replace 0 by —@, we conclude that they hold when 
a] < 2/2 and 0 = 0. The desired formulas (8.14) follow from this and 
the sandwich theorem. 

To derive the formulas for derivatives of sines and cosines, we use the 
formulas 


sin (x + Ax) = sin x cos Ax + cos x sin Ax 
cos (x + Ax) = cos x cos Ax — sin x sin Ax 


to obtain 
sin (x + Ax) — sin x = — sin x(1 — cos Ax) + cos x sin Ax 
cos (x + Ax) — cos x = — cos x (1 ~ cos Ax) — sin x sin Ax. 


After dividing by Ax, taking limits as Ax — 0 of the resulting difference 
quotients gives, with the aid of (8.14), the fundamental formulas 


(8.15) a sin x = cos x, a cos x = — sin x. 


dx dx 
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These formulas, which we have used many times, have at long last been 
proved. For values of x for which the functions are defined, we obtain 


’ 2 m2 
(8.151) Z tanx = 2 Sin* _ costa + sin’ x 


2 
sec? x 
ax dx cos x cos? x 
d d cos x — cos? x — sin? x 
(8.152) cot x = = i = ——_,—_ = — cse* x 
d dx sin x sin? x 
d 1 sin x 
(8.153) —- sec x = = —— = —y— = secx tanx 
dx dx cosx cos? x 
d d 1 — cos x 
(8.154) = csc x = = —— = —3— = — csc x cote. 
dx dx sin x sin? x 


The graphs of sin x and cos x are so important that we reproduce Fig- 
ure 1.58 in Figure 8.16 and give further attention to the procedure by 


Figure 8.16 


which reasonably accurate graphs are quickly sketched with or without 
use of graph paper. The trick is to sketch guide lines 1 unit above and 
l unit below the x axis, to hop 3 units and a bit more to the right of the 
origin to mark r, to make another such hop to mark 2r, and then sketch 
reasonably good copies of the figure. Each graph has slope 1 or — 1 where 
it crosses an axis, and noticeable contradictions of this fact should not 
appear. In the problems at the end of the next section, we shall obtain 
formulas from which sin @ and cos @ can, for a given 0, be calculated as 
accurately as we wish. Meanwhile, we can be interested in Figure 8.17, 
which enables us to obtain reasonable estimates of sin 0 and cos 0 when 
0 <0 <xr/2. The circle has radius 1, and the radial lines make angles 
0.1, 0.2, -- +, 1.5 with the positive x axis. For example, the rough 
approximations 


sin 0.35 = oat = 0.34, cos 0.35 = 024 = 0.94, 
0.34 


tan 0.35 = 0904 = 0.36 


can be read from the figure. 
With the aid of information about derivatives, it is easy to see that 
the graph of the function ¢ for which ¢(x) = tan x has the form shown in 
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Figure 8.17 Figure 8.171 


Figure 8.171. For each integer n, the interval |x — nr| < 1/2 contains 
an exact copy of the graph in the interval |x| < r/2. The lines 


x=nrt 


TE 


are all vertical asymptotes of the graph, and tan (nr + r/2) is undefined. 

We should now be quite familiar with the fact that each formula for 
a derivative has a chain extension. The following list contains the chain 
extensions of formulas for derivatives of the six trigonometric functions 
and three additional formulas. All of these must be learned. 


d . _ du _ > du 

Jy `P Y= COS u g Ją Ot 4 = ese u Tx 

d du 
g; cos v = o sin wa Jy SEC v = Sec u tan u 
4 eanu = sec? u Z Ê escu = — csc u cot u — 
dx dx dx dx 
d a n1 d „ du d, _ ldu 
dx T” Te dx’ TE dx’ dx 8 u = dx 

Problems 8.19 


1 With all graphs and tables out of sight, make the pretense that it has been 
forgotten whether the derivative with respect to x of sin x is sin x or — sin x or 
cos x or — cos x. Sketch graphs of sin x and cos x and make these graphs give 


the correct answer. 
2 Explain how you should modify the procedure for sketching the graph of 
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y = sin x to obtain a procedure for sketching the graph of y = 3 sin x, and sketch 


the graph. 


3 Equate the derivatives of the two members of the identity in the left 
column and show that obvious simplifications give the identity in the right column 


when 
(a) sin 2x = 2 sin x cos x cos 2x = cos? x — sin? x 
(b) cos 2x = cos? x — sin? x sin 2x = 2 sin x cos x 

(c) sin? x + cos? x = 1 0=0 
(d) sin (x + $) cos (x + ¢) 

= sin x cos $ + cos x sin @ = cos x cos @ — sin x sin @ 
(e) cos (x + ¢) sin (x + ¢) 

= cos x cos $ — sin x sin @ = sin x cos ¢ + cos x sin @ 
(P cos? x = =E 08 7 2 sin x cos x = sin 2x 

(g) sin? x = Leota 2 sin x cos x = sin 2x 
(h) sin (« + 5) = COS x cos (« + 3) = —sin x 

(i) eže? = e?z etext = ett 

2x 2 
; 2 — Z -2 
(7) log x? = 2 log x = 


4 Supposing that 0 S$ x < 7/2, let f(x) = tan x and show that 


f'(x) = sec? x 
f” (x) = 2 sec? x tan xe 
f” (æ) = 2 seci x + 4 sec? x tan* %3 


and obtain the next derivative. Show that each derivative of higher order will 
also be a sum of terms of the form 4 sec? x tan’ x, where Æ, p, q are non- 
negative integers, and hence that f™(x) 2 O for each n and x. 

§ Supposing that 0 < x < 7/2, calculate the first three derivatives of sec x. 
Show that these and all derivatives of higher order are nonnegative. 

6 Again supposing that0 < x < 2/2, calculate some derivatives of ~/tan x 
and try to decide whether they are all nonnegative. . 

7 Prove the formula o; 


- 


rigi = sec O — tan +O +6 
by differentiating the right side. 

8 The length L of the longest beam that can be taken in a horizontal position 
around the corner of Figure 8.191 is the length L of the 
shortest line segment placed like the one in the figure. 
Find L. Ans.: L = (a? + 532)34, Remark: Putting the 
equation in the form a% + b% = L% can make us 
wonder why (a,b) should, for a fixed L, be a point on a 


hypocycloid. 


Figure 8.191 
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9 Find the length L of the shortest ladder that can 
reach from level ground to a high wall when it must go over 
a fence which is a feet high and b feet from the wall. 
Ans.: L = (a% + b%)%, 

10 A heavy body is suspended from a rope, as in Figure 
8.192, that runs up from the body to a pulley at E and 
thence (a wonderful word) over two stationary pulleys at Figure 8.192 
A and B on the same horizontal level and back to E, where 
the pulley is tied to the other end of the rope. How should old man gravity 
select the @ of the figure in order to gratify his desire to bring the heavy body to 
its lowest possible position? Ans.: cos 6 = $ and 0 = 7/3 or 0 = 60°. 

11 If |P,| is the length of a regular polygon of n sides which is inscribed in a 


circle of radius a, prove that |P,| = 2an sin (t/n) and hence that lim [Pal = 2ra. 
n—> © 


12 Supposing that 4 and B are constants not both 0 and that 


y(x) = A sin x + B cos x, 


calculate y’(~) = y’’(x) and show that 


y” (x) = —y(x). 


In terms of the graph of y(x), tell precisely what it means to say that y” (x) < 0 
when y(x) > 0 and y’’(x) > 0 when y(x) < 0. Sketch graphs and verify your 
conclusion when 4 = 0 and when B = 0. It is now required that we learn a 
little trick that happens to be very important. Plot the point having coordi- 
nates (4,B) and, as in Figure 8.193, let @ be an angle having its terminal side on 
the line running from the origin to the point. Let C = 4/4? + B? and observe 
that 
A =C cos > and B =Csin ġ 
so 
y(x) = C(sin x cos ¢ + cos x sin œ) 


and hence 
y(x) = C sin (x + @). Figure 8.193 


Finally, it is required that we learn some technical terminology by which this result 
can be remembered. Functions of the form £ sin (wt + ¢) and E cos (wi + ¢) 
are called sinusoids (things like sines) of angular frequency w. Since x can be ot, 
we have proved that the sum of two sinusoids having the same frequency w 15 also 
a sinusoid having frequency w. 

13 It is very easy to show that if k, 4, B are constants for which k > 0 and 


if 

(1) y = A sin kt + B cos kt 
then 

(2) 3 + k?y = 


All we need to do is differentiate (1) twice and look at the result. The theory 
of differential equations contains theorems which imply that if y is a function 
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for which (2) holds, then there must be constants A and B for which (1) holds. The 
latter and more difficult result is often needed by students who have not yet 
made reliable contacts with differential equations. Our solution of the more 
difficult problem will be postponed until we have solved the easy problem by a 
new method which involves only reversible steps. Starting with the formulas 


(3) dy = Ak cos kt — Bk sin kt, 


y = Asin kt + B cos kt 
(à 


we eliminate B, and then eliminate 4, from these equations to obtain 


cos a z+ (k sin kt)y = Ak 


(4) b 
sin kt- — — (k cos kt)y = — Bk. 
Therefore, 
z [cos He S kt = + (k sin ty | =0 
6) d i 
5, | sin be — (k cos ky | = 0 
or 
d?y 
cos kt => Prin k sin ee 2 = + k sin ae + (k? cos kt)y = 0 
(6) d?y D 
sin kt -ya + k cos aS — k cos u? + (k? sin kt)y = 0- 
or 
2 
cos kt k + Hy] =0 
(7) 


. d 
sin kt E + By | = 0. 
Since there is no ¢ for which cos kt and sin kt are both 0, we conclude that 
d* 
(8) Fa t Ry = 0. 


We are now ready to prove the more difficult result. Suppose that y is a given 
function for which (8) holds. Then we can multiply by cos kt and sin kt to 
obtain (7) and hence (6) and hence (5). Since the derivatives of the quantities 
in brackets are zero, these quantities must be constants which we can call 4k 
and —Bk to obtain (4). Solving the equations (4) for y and dy/dt gives (3) 
and our problem is solved. 


14 Itis very easy to show that if k, 4, B are constants for which k > 0 and if 


(1) y = Ac + Bet, 
then 
(2) Z — ky = Q. 


8.1 Trigonometric functions and their derivatives 447 


Do this. Then adapt the method of the preceding problem to show that if y 
is a function for which (2) holds, then there must be constants 4 and B for which 
(1) holds. 

15 Prove that if y and u are functions of t having second derivatives and if 


= ght 
yore u, 


then 


a yy 2,2 yt hy = ou | aa + (A — Wya |, 


16 À right circular cylinder (like a tomato can from which the top and bottom 
have been removed) has height A and base radius a. We examine the idea that 
good approximations to the area of this surface are obtained by triangulating it 
and calculating the sum of the areas of the triangles. The surface is partitioned 
into m strips each having height h/m like the 
one shown in Figure 8.194. On the top and 
bottom circular boundaries of each strip, we 
place the vertices of 2n isosceles triangles con- 
gruent to the triangle ABC of the figure. The 
side AB of the triangle subtends the angle 2r/n 
at the center O of the top circle. Of the 2n 
triangles in the top strip, have two vertices 
on the top circle and the other n have two 
vertices on the lower circle. The total number 
of triangles congruent to the triangle ABC is 
2mn. ‘The first factor in the product 


(eant) (EJ + (e-em) 


is half the length of the base 4B of the triangle. Figure 8.194 
The last factor is the altitude, being the length 

of the hypotenuse of a triangle one leg of which runs from C to the right angle 
on the upper circle and the other leg of which runs (in the direction of O) from the 
right angle to the base 4B. The sum Smr of the areas of all of the triangles is 
the product of (1) and 2mn. Therefore, 


(2) Sma = 


When m and n are both large, the quotients having sin (m/n) in their numerators 
are near 1 and we have the approximation 


(3) Smn ~ 2rah q j1 + (=) (3): 
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In case m = n, and in other cases where 


. m 
we have 
(5) lim Sn = 2rah, 
m,n— œ% 


the right member being the number usually considered to be the correct area of 
the cylinder. In case m = n?, we obtain 


2, wi Ne 
(6) lim Sma = 2eah J + (= 2) 


m,n—> @ 


the right member being a number which is not usually considered to be the correct 
area of the cylinder. Other remarks can be made. The above calculations were 
made by a German mathematician Schwarz, and they constitute the Schwarz 
paradox. The paradox shows that the triangulation idea provides a precarious 
basis for definitions of area of curved surfaces. The theory of these areas is 
extremely complicated. 

17 Itis sometimes useful as well as interesting to have information about the 
things we see. A polynomial P in x and y is the sum of a finite set of terms of the 
form cx?y, where c is a constant and 7 and k are nonnegative integers. The poly- 
nomial has degree n if j + k = n for at least one term in the sum having a non- 
zero coefficient and j + k S n for each term in the sum having a nonzero coeff- 
cient. A polynomial in which the coefficients are all zero is said to be #trigial; 
it does not have a degree. Thus the polynomials having values 


(1) xt + xy + yt — 34, (y? lee + y? — 4) 


both have degree 4. A nontrivial polynomial is irreducible if it is not the product 
of two polynomials of lower degree. An algebraic equation of degree n is an equa- 
tion of the form P(x,y) = 0, where P is a nontrivial polynomial of degree in x 
and y. The graph of an algebraic equation is an algebraic graph. A function f 
of one variable x is said to be an algebraic function if there is a nontrivial poly- 
nomial P(x,y) such that 


(2) P(x, f(x)) = 0 


for each x in the domain of f. Since each nontrivial polynomial in x and y can 
be put in the form 


(3) Qo(x) + Qilx)y + O2(x)y? + > + + On(x)y", 


where Qo, Qi, - © © , Qn are polynomials in x at least one of which is nontrivial, 
it follows that a function f is an algebraic function if and only if there exist poly- 
nomials Qo, Qı, - © © , Qa in x such that Q, is nontrivial and 


(4) Oolx) + OAE) + ODEN + - + + + On(%) UI" = 0 


for each x in the domain of f. Functions that are not algebraic functions are 
said to be transcendental functions, the old idea being that polynomials lie at the 
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foundation of human experience and that things not closely related to polynomials 
are more ethereal. This matter can be of interest to us now because, even in 
quite elementary mathematics, the trigonometric functions are called trans- 
cendental functions. For example, the assertion that sin x is transcendental 
means that there do not exist polynomials Qo, Qı, © © © , Qa in x such that Qn 
is nontrivial and (4) is true for each x when f(x) = sin x. Let us now think 
briefly about real numbers. A number x is said to be an algebraic number if there 
exist integers ko, kı, ke, © © © , kn not all zero such that 


ko + kix + kx? + ++ > + kax” = O. 


Thus a number x is algebraic if it is a zero of a nontrivial polynomial in x having 
integer coefficients. A number x which is not an algebraic number is said to be 
a transcendental number. The numbers m and e are transcendental. Nobody 
can prove these things unless he devotes very much time and energy to the opera- 
tion, but nobody requires us to be so busy digging ditches that we never look at 
the stars. We can know that there exist theories of algebraic functions and 
algebraic numbers and that these theories invite the attention of persons who 
have completed studies of analytic geometry and calculus. 


8.2 Trigonometric integrands Before introducing chain extensions 
and other modifications of the formulas, we systematically work out 
formulas for integrals of the six trigonometric functions. Even though 
the last three or four have minor importance, we must learn about them 
to be respectable. The first two are 


(8.21) f sinx dx = — cos x + c, f cos x dx = sin x +c. 


They are immediate consequences of the formulas for derivatives of 
sines and cosines, since the formula 


Sf(*) dx = F(x) + c 


is valid over an interval if and only if F'(x) = f(x) over the interval. 
It is necessary to keep negative signs in their places when we differentiate 
and integrate sines and cosines, and it happens that a foolish little trick 
enables us to permanently remember how the signs go. We can men- 
tally write 


(8.22) | derivative integral 
sine cosine 


in their natural orders and remember that “like things give plus,” so 
differentiating sines and integrating cosines give plus signs, but that 
“unlike things give minus,” so differentiating cosines and integrating 
sines give minus signs. 

If an interval contains one of the points nr + 7/2, there can be no F 
such that f tan x dx = F(x) + c over that interval. When x is confined 
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to an interval containing none of these points, we can use the funda- 
mental formula 


| [u(x] tu (x) dx = log |u(x)| + c 


to obtain 


(8.23) | onzas = — | l (— sin x) dx = — log |cos x| + c 
cos x 
= log jsec x| + c. 


In most applications of this formula, |x| < r/2, so cos x > O and the 
absolute-value signs can be omitted. For cot x, a miniature graph of 
which appears in Figure 8.232, we suppose that x is confined to an inter- 
val containing none of the points x = nm and obtain 


(8.231) f co x dx = | 1 cos x dx = log |sin x| + c. 
sin x 


In most applications of this formula, 0 < x < r, so sin x > O and the 
absolute-value signs can be omitted. 
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Figure 8.232 Figure 8.233 Figure 8.234 


A sketch of the graph of y = sec x is most easily obtained by sketching 
a graph of y = cos x and estimating reciprocals. As the graph in Figure 
8.233 indicates, sec x is undefined when x is an odd integer multiple of 
aw/2. When we integrate sec x over an interval, we must suppose that 


the interval contains none of these points. We can then obtain the 
formula 


(8.235) J = X dx = | | (sec? x + sec x tan x) dx 


sec x + tan x 
log |sec x + tan x| +c 


I 


provided we happen to know that the result is obtained by multiplying 
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the numerator and denominator of (sec x)/1 by the implausible factor 
sec x + tan x. We dispose of csc x as rapidly as possible by writing 


(8.236) | = f dx = — | a (—csc? « — csc x cot x) dx 


csc x + cot x 
— log |csc x + cot x| + c. 


When 0 < x <r as in Figure 8.234, we can omit the absolute-value 
signs. The chain extensions of the six formulas are placed in a table at 
the end of this section where they are most available for reference. The 
formulas 


(8.24) | sin ax dx = F sin ax a dx 


(8.241) | cos ax dx = Ji cos ax a dx 


1 
— 700s ax + ce 


1. ` 
7 Sin ax + c, 


which hold when a ¥ 0, are by far the most important applications of 
the chain extensions. 

We now consider some of the more or less important integrals that can 
be evaluated in terms of elementary functions. Adding and subtracting 
the two elementary formulas 


(8.25) cos? 0 + sin? 6 = 1 

(8.251) cos? 6 — sin? 0 = cos 20 

gives the two formulas 

(8.252) cos? @ = Lt cos 2, sin? §@ = L= cos 76 


that are so useful that we should either learn them or be able to work them 
out very quickly. Moreover, these formulas should come to mind when 
we are asked to evaluate the first integrals in the formulas 


(8.253) | cos? ax dx = | LE 608 208 ay 


= AE + >. | cos 2ax(2a) ax | 
1 
2 


[e+ sin 2as | +c 


2a 
and 
(8.254) Jan ax dx = [= dx 


E -> | cos 2ax(2a) ds | 


E — 1 sin 2a | +c. 
2a 
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Students are frequently called upon to realize that the formula 


1 + cos N = Lice 2x + cos? 2x 
Ta o G 


cost x = (cos? x)? = ( 


is useful when the left member must be integrated, and to know what to 
do to complete the problem. 
The integral in 


(8.26) I = f sin mx cos nx dx 


looks forbidding until it is discovered or remembered that a useful 
formula for the integrand can be obtained by adding the formulas 


sin (0 + ¢) = sin 6 cos ¢ + cos 0 sin ¢ 
sin (0 — $) = sin 6 cos ¢ — cos @ sin ¢. 
Thus 
Hsin (mx + nx) + sin (mx — nx)] = sin mx cos nx 
and 


1 . 
(8.261) I= mew | [sin (m + n)x](m + n) dx 
1 . 
+ Im — n) J [sin (m — n)x|(m — n) dx 


= E Ew 7”) cos (m + n)x — X 


z) cos (m — n)x +e 


m — 
except when m = n or m = —n. Similarly, the integrals 
(8.262) f sin mx sin nx dx, f cos mx cos nx dx 


can be evaluated by use of formulas obtained by adding and subtracting 
the formulas 


cos (0 — ¢) = cos 6 cos @ + sin 9 sin ¢ 
cos (0 + @) = cos 0 cos @ — sin @ sin @. 


Sometimes integrals can be evaluated by making quite direct use of 
the power formula and other integration formulas. For example, 


(8.263) | sin? ax- cosas dx 


`np+l 
= Al (sin ax)?(a cos ax) dx = 1 sin?t! ax +c 


a p+li 


when a Æ 0 and p # —1. Sometimes we need some ingenuity. For 
example, the integral 


(8.264) f sin? x cos? x dx 
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is rendered manageable by replacing the factor cos? x by the last member 
of the formuia 


cos? x = cos? x cos x = (1 — sin? x) cos x. 
The integral in 
(8.265) fem? sin x dx = — fez( — sin x) dx = — gcse + c 


is evaluated by making the adjustment and compensation necessary to 
put it in the form fe*)x’(x%) dx, and opportunities to make such adjust- 
ments and compensations should always be observed. 

There are reasons why some of the integrals we have evaluated may 
be said to be so important that everyone should know how to evaluate 
them. There is an old and perhaps honorable tradition that requires 
students of the calculus to spend huge amounts of time cultivating “the 
technique” of “formal integration.” The fact that we live in an age of 
electronic computers makes it much more important to learn fundamental 
theory than to acquire skill in formal integration. For this reason, the 
author requests that teachers join him in avoiding all but the simplest 
formal integration problems that are not likely to be encountered by 
undergraduates in courses other than calculus courses. ‘The student” 
who has not read dozens of calculus books and does not know what we 
are talking about is invited to look at the shiny example 


(8.27) I = f Vsin x tan «(1 + ~/cos x) dx 

of an integral that we shall not expect him to evaluate quickly. A 
person who has constructed this problem can easily feel very sure that 
the only sensible attack upon the problem lies in setting 


(8.271) u(x) = 1 + (cos x); 


so, when 0 < x < 7/2, 


(8.272) u'(x) = $(cos x)—4(— sin x) 

= -4\—— = —}$Vsinxtanx 
and hence 
(8.273) I= -—2 J (u(x) lu (x)] dx 


= -7 BON yen —$[1 + V cos x]* + c. 
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Instead of inviting attention to problems of this nature, we present prob- 
lems more likely to promote scientific competence. 


Table 8.28 
_ au du . 
f sin uaz = — cos ute [ cos ut ax = sinue 
dx dx 
d d 
J tan u ax = log lsc ul + c f cot u dx = log [sin ul +c 
dx dx 
du du 
J sec uF dx = log |sec u + tan ul + c J csc uw dæ = log lese u — cot u| + c 


urt du 
[rin ote [ -2 i= hoga +e, f ee desete 


Problems 8.29 
I Make all of the calculations necessary to show that 
r x/2 
(2) h sin x dx = 2 (b) l cos x dx = 1 
b b 
(c) lim Í sin wx dx = 0 (d) lim Í cos wx dx = 0 
> 00 a w—> 0 a 
1 fz l 
(e) lim -f sin? wt dt = $ (f) lim - ” cos? wi dt = $ 
Z> o x 0 T- © x 
x/4 
o h tan x dx = % log 2 iy fo” tan x dx = 0 


2 Recall that, when x is a differentiable function of x and u =Æ 0, 


d 1 du 
(1) 7, 108 lul = 5 3 
the absolute-value signs, which are superfluous when u > 0, need not bother us. 
Supposing that x is not an odd multiple of r/2 and that 


un (5+ 9) 


(2) f(x) = log |sec x + tan xl, g(x) = log 


show that f'(x) = sec x and g'(x) = sec x. 


Remark: This proves that the two formulas 
Æ) f sec x dx = log |sec x + tan x| + cı 


At) [ sec x dx = log tan (5 +7)|+e 


are both correct. Some integral tables contain both of them. These things 
imply that, over each interval containing no odd multiple of w/2, f(x) — g(x) 
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must be a constant, but they do not imply that this constant must be zero. The 
identity 


1+tan= cos = Z 4 sin cos? + sin Ž 
x” r 2 2 2 
tan (3+3 = - —__i 3 
1 — tanz cos 5 sin 5 Cos 5 + sin 5 
7 x oo- - 
1 + 2 sin 5 cos 5 1+sin x 


= = = secx + tan x 


clarifies the matter. 
3 Prove that if x is not an even multiple of 7/2 and 


x 
tan =) 


f(x) = log |csc x — cot x|, g(x) = log 7 


then f'(x) = csc x and g'(x) = cscx. Remark: This proves that the two formulas 


J csc x de = log lese x — cot x| + ¢, J cscs ds = log +c 


tan 5 


are both correct. 

4 When a steady (or constant) current J (measured in amperes) flows 
through a wire (or resistor) having resistance R (measured in ohms), the quantity 
Q (measured in watthours, or thousandths of kilowatthours) of energy converted 
into heat in Az hours is calculated from the formula 


(1) Q = PR At. 


With this basic information, show that if J(¢) is integrable (Riemann) over the 
time interval ¢; < ¢ S tz then the quantity Q(tı,t2) of energy converged i into heat 
between times żı and tz is (or should be) 


t2 
(2) Ott) =R fy UOR ae 
Suppose now that Z(t) is the sinusoidal (or alternating) current determined by 
(3) I(t) = Io sin (wt + 9), 


°. o . 93 
where J is a constant “maximum current,” w is a constant “angular frequency, 
and ¢ is a “phase angle.” Show that, in this case, 


(4) QO(ti,te) = aay cR — ah = hin (2wte + 26) — sin (2wti + 2¢)]. 


The last term is 0 whenever 2wto — 2w#; is an integer multiple of 27, and in every 
case the absolute value of the last term cannot exceed RJ$/2w. In all ordinary 
applications of this formula, te — t, is so large in comparison to 1 /w that the last 
term in (4) is insignificant. In such cases, (4) is always replaced by 


(5) Q(ty,t2.) = FIGR(t2 — t1). 
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Putting (5) in the form 


(6) O(ti,te) = (<5) R(tz — tı) 


shows that a sinusoidal current having maximum value Jy) produces heat (or 
dissipates energy) just as rapidly as a steady current of magnitude /o/ /2. For 
this reason, the number J)/+/2 is called the effective value of the sinusoidal current. 
For ordinary house current having “effective voltage” 120 volts, the maximum 
(or peak) voltage is not 120 volts but is 120 a/2 volts. 

- § Let, when Lis a positive number and n = 1, 2, 3, ++, 


n(x) = NE sin To Yo(x) = 7 Phl) = vi cos a 


Show that the first of the formulas 
L L 
fi baC)dalx) dx = 0, fy da(u)}* de = 1 


holds when m and n are different positive integers and that the second holds 
when n = 1, 2,3, ---. Show that the first of the formulas 


[i vndvale) de = 0, fy Wale)]* dx = 1 


holds when m and n are different nonnegative integers and that the second holds 
when n = 0, 1,2,- --. Show that 


[vole bale) de = VISE fy = 1,2,3,-- 9, 


nT 


Show that, when m, n = 1,2,3,-°-, 


L — 1— — 
f Wnlx)oa(x) dx = 1 — cos (m + n) , 1 — cos (m = nt 


(m + n)r (m — nr 


where the last term is to be omitted when m = n. Remark: While students in 
calculus courses have not yet heard about the matter, the above formulas are of 
great interest in the theory of orthonormal sets and Fourier series of the trigo- 
nometric variety. Our calculations and a little theory produce the interesting 
formulas 


T Z sin TZ p L gin 
477z 3 Sin | 


Sax 


Hsin yee. O <r <D 
T? 1 1 1 r? 1 1 1 
gritgtatzyat cc. GeltpmtRtet 


and many others. 
6 A problem in Section 8.4 will tell us about the formula 


x/2 
I sin? 0 cos? 0 d§ = 


8.2 Trigonometric integrands 457 


which is correct whenever p > —1 and g > —1. As usual, 0! = 1, 1! = 1, 
2! = 1:2, 3! = 1:2:3, 4! = 1:2°3-4, etcetera. The values of x! when x is not an 
integer are more esoteric, but x! > 0 when x > —1. Put p = g = Q and dis- 
cover that (—4)! = Jz. Show that the formula is correct when p = q = 1. 


Put p = q = 2 and discover that (5)! = Wr/2. Put p = 2x — l andqg=1 
and use the result to prove that x! is the product of x and (x — 1)!. 
7 With an assist from (8.144), which shows that 


1 1 1 /sin 0N? 1 
—— J9 < 
J arat s j (T) as f, 1%, 


the middle integral being a Riemann-Cauchy integral because the integrand is 
undefined when @ = 0, show that 


1 1 /sin 0N? 
ssf ( ) si. 
0 0 


Then prove the first of the inequalities 


© : 6\2 
os [Sas isf (Sas 


and use it to obtain the second. 
8 Let a and b be constants for which a = 0 and b = 1 ora = l and b = 0 
and let x > 0. Show that 


—lSacosx+bdsinx 21. 
Replace x by ż and integrate over the interval 0 < ¢ £ x to obtain 
—x Sasinx — b(cos x — 1) Sx. 
Replace x by ¢ and integrate over the interval 0 £ ¢ £ x to obtain 


x2 
-5 £ —a(cos x — 1) — b(sin x — x) $ F 


Repeat the process to obtain 


2 3 

-5 s- —a(sin x — 2) + b( cose -1+5 <5 

x4 , x4 
-2 <a(cos#—1+ 5) +è (sin a — eth) si 
-E s a(sine —2 +5) -t (coss 1+3- asf 


Repeat the process two more times. With or without more attention to details, 
jump to the conclusion that 


, x3 x5 xl yant at 
jeer Gaston +o Sail E ae Eo 
x? x xen _|aite*? 
osx- (1-F+F- ait: +3) |saiem 
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for each n = 1, 2, 3, ---. It is true (and is easy to surmise) that, for each x, 
[x|"/n!—+Oasn— æ. Therefore, 


. xs x x 
sin x = x — 3] 


3! 7! 
cosx=1—7 41 6! 


9 The formulas at the end of the preceding problem have been proved to 
be correct when x > 0. Use this fact to prove that the formulas are also correct 
when x $0. Hint: Look first at the case in which x = 0. Then use the facts 
that sin (—x) = — sin x and cos (—x) = cos x. 

10 Havea good look at the formulas at the end of Problem 8 and start learning 
them by obtaining some of the following results correct to four or more decimal 
places. 


sin 0.01 = 0.00999 9833 
sin 0.02 = 0.01999 8667 
sin 0.10 = 0.09983 3417 
sin 0.20 = 0.19866 9331 
sin 1.00 = 0.84147 0985 
sin 1.10 = 0.89120 7360 


cos 0.01 = 0.99995 0000 
cos 0.02 = 0.99980 0007 
cos 0.10 = 0.995C0 4165 
cos 0.20 = 0.98006 6587 
cos 1.00 = 0.54030 2306 
cos 1.10 = 0.45359 6121 


11 Digest the following idea. We have a desk calculator and National 
Bureau of Standards Tables giving the values 


sin 0.2345 = 0.23235 6699 
cos 0.2345 = 0.97263 0641 


If we want to find sin 0.23456 789 correct to eight decimal places, we can use 
the identity 


sin (x + 0.2345) = sin x cos 0.2345 + cos x sin 0.2345, 


where x = 0.00006789. The values of sin x and cos x can easily be found correct 
to 10 decimal places from the formulas at the end of Problem 8. 


8.3 Inverse trigonometric functions Before coming to the an- 
nounced subject of this section, we think about a general situation appli- 
cations of which appear in many branches of mathematics. Suppose 
we have an operator or transformer or mapper or function f that carries 
or transforms or maps each element x of a set D (the domain of f) into 
an element y of a set R (the range of f). In some cases the transformer 
transforms two or more different elements of -D into the same element 
of R. For example, if f is one of the six trigonometric functions and 
f(x) exists, then f(x + 2r) = f(x) and hence more than one element of 
the domain is carried into the same element f(x) of the range. When 
problems involving domains and ranges are involved, it is very often 
possible to eliminate confusion by singling out for special attention a 
subset D, of D such that to each y in R there corresponds exactly one x 
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in Dı for which f(x) = y. Then we can introduce a function fı, which 
is different from f because it has a different domain, for which f1(x) = f(x) 
when x is in Dı and for which f(x) is undefined when x is notin Dı. This 
function fı is called the restriction of f tò Dy. We now have a general 
situation that may be easier to visualize than its applications to trigono- 
metric functions. The schematic Figure 8.31 shows the domain and 


x x+Ax x 
Figure 8.31 Figure 8.32 


range of fı, and the upper arc from x to y can make us think of a path 
along which fı might carry x to the y which is f,(x). The function fi 
sets up a one-to-one (or schlicht) correspondence between D, and R, 
that is, to each x in Dı corresponds exactly one y in R for which f(x) = y 
and to each y in R corresponds exactly one x in D, for which f(x) = y. 
The function with domain R and range D, which carries each y in R 
into the x in D, for which fı(x) = y is called the inverse of fı and is denoted 
by fy’. As indicated by the figure, fī! undoes what fidoes. Iffi(~) = y, 
then x = fr*(y). Moreover, fy*(fi(~)) = x when x is in D and 


ATO) =y 
when y is in R. 


Figure 8.32 shows the graph of a particular function f to which the 
following theorem applies. 

Theorem 8.33 If f is continuous over an interval D, in E, and if f has a 
derivative for which f'(x) > O at each inner point x of D, [or f'(x) < 0 at 
each inner point x of Dı) then the restriction fı of f to D, has a range R 
which is an interval and has also an inverse fī which is differentiable at 
each inner point y of R. Moreover, the first of the formulas 


(8.331) fi") = IEO F = i 
dx 


is valid when y is an inner point of R. 

It seems to be possible to put the first formula of (8.331) in the second 
form without sacrifice of meaning. To prove the theorem, we use the 
hypotheses to conclude that fı is increasing (or decreasing) over Dı and 
that (because of the intermediate-value theorem) R is an interval. Hence 
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fı has an inverse defined over R. Since we must prove that fr is differ- 
entiable, we introduce difference quotients for fy’. Supposing that y 
and y + Ay belong to the domain R of fi’, we define x and x + Ax by 
the formulas x = f;(y) and x + Ax = fi (y + Ay) and obtain y = f(x), 
y + Ay = fi(x + Ax). Since fı has a positive (or negative) derivative, 
we conclude that Ax + 0 when Ay #0 and that Ax—0 as Ay— 0. 
Therefore, 


n filly + Ay) — fry) L y Ax 
(8.352) lim Ay Dm FG F as) — TO 
. 1 
= hm F@ + Ax) ON ie = AOON 
Ax 


This proves Theorem 8.33. Of course it is possible to abbreviate the 
calculations by writing 


dx Ax 1 1 
8.333 fX = lim Ž = —— =, 
(8-333) dy ayr0 Ay lim SY B 


but this one line is not, by itself, the equivalent of a theorem and proof 
which present conditions under which the formula is correct. 

Our general discussion of inverse functions and Theorem 8.33 will now 
be used to guide us to six functions that are called inverse trigonometric 
functions even though they are in fact inverses of restrictions of trigono- 
metric functions. It will turn out that the six functions will all have 
values between 0 and 7/2, and that all of the information implied by 
Figures 8.341, 8.342, and 8.343 will be correct, provided 0 < x < 1 in 


Q x [L x [L 
cE} EX mo 
— | — | A) 

vi-x? 1 I 


Figure 8.341 Figure 8.342 Figure 8.343 


Figure 8.341, x > 0 in Figure 8.342, and x > 1 in Figure 8.343. The 


first figure shows, for example, that 


x 
Vi-x 


when 0 <x <1. These triangles can sometimes provide helpful infor- 
mation even when the functions appear in problems involving the mis- 
fortunes of gamblers who bet on horse races. 

To begin, let f be the trigonometric function for which f(x) = sin x, 
the domain D being the infinite interval — œ < x < œ and the range 


sin (sin! x) = x, tan (sin! x) = 
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R being the closed interval —1 | y $1. When we are called upon to 
select a domain Dı to which we can apply Theorem 8.33, the graph of 
y = sin x can remind us of known properties of its derivative which show 
us that if we are going to be able to apply Theorem 8.33, we must let 
D, be the interval —x/2 < x < 1/2 or some other interval lying nr 
(n an integer) units to the right or left of it. Making the simplest choice, 
we let fı be the function, defined only when —r/2 < x < 1/2, for which 
fi(x) = sin x. The inverse of fı is called the inverse sine. Ify = fi(x), 
so that y = sinx and —r/2 S x S7/2 and —1 < y < 1, we write 
x = sin! y. The graph of x = sin“! y coincides with the part of the 
graph of y = sin x for which —r/2 < x < r/2, and the graph is shown in 
Figure 8.351. When we interchange x and y, which amounts to replacing 


y=sin] x 


x=sin y 


3 


Figure 8.351 Figure 8.352 


each point (x,y) by its “image” (y,x) in the line y = x, we find that the 
graph of the equation y = sin™! x is the same as the part of the graph 
of x = sin y for which —r/2 S y S r/2 and that the graph is shown in 
Figure 8.352. When —1 <x < l and y = sin™ x, we have sin y = x, 
—r/2 < y < r/2, and cosy > 0. Since Theorem 8.33, with x and y 
interchanged, implies that dy/dx exists, we can use the chain rule to 
obtain cos y(dy/dx) = 1 and hence 


A _ dy | ee ee ee 
(8.353) Fy Sin == = = 


While we could not be expected to guess the result in this formula, we 
can look at Figure 8.352 and see that it seems to be a very reasonable 
answer. One who feels that inverse sines and their derivatives could 
never be useful should be informed at once that (8.353) gives the formula 


(8.354) x= sin!x +c (\jx| < 1) 


| = 


which is useful because the integral appears in important problems. 


462 Trigonometric functions 


The inverse cosine is the inverse of the function fı, with domain 0 < 
<r and range —1 <x 1, for which fi(x) = cos x. Graphs of 
cos™! y, of y = cos™! x, and of the relevant parts of y = cos x and 
cos y appear in Figures 8.355 and 8.356. When —1 <x <1 and 


x 
x 
x 


Figure 8.355 Figure 8.356 


y = cos™! x, then cos y = x,0 < y < r, and sin y > 0. Again Theorem 
8.33 implies that dy/dx exists, so (—sin y) (dy/dx) = 1 and 


d —1 = sO = ss oooi 
6BD Ge O88 = de ny A oy VI 


This gives the integration formula 


(8.358) | oat = —cos! x + c (læ < 1) 


which, because sin™! x + cos™! x = 2/2, does not contradict (8.354). 

As Figure 8.171 may suggest, the in- 
y=tan x y verse tangent is the inverse of the func- 
x=tan y 
tion fọ, with domain —r/2 < x < 7/2 
and range ~œ <x< æ, for which 
fi(x) = tanx. The graph of y = tan`! x, 
which is the same as a part (sometimes 
Figure 8.36 called the principal part) of the graph of 

x = tan y, is shown in Figure 8.36. The 
domain of the inverse tangent is the entire set of numbers, and if 
y = tan™! x, then tan y = x and sec? y(dy/dx) = 1, so 


d _,._dy_ 1 1 > I 
(8.361) g tn a = = a = E 
This gives the important formula 


(8.362) J — de = tani x 4 c. 
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The inverse cotangent is the inverse of the function fı, with domain 
0<x* <aand range —œ <y < ow. A graph of y = cot! x is shown 
in Figure 8.37. The domain of the inverse cotangent is the entire set of 


Figure 8.37 


real numbers and if y = cot! x, then cot y = x and — csc? y(dy/dx) = 
sO 
qd, _dy_ —l _ -1 —1 

(8.371) de OC S T= Gay = IF eos ~TE 

As Figure 8.233 indicates, the graph of y = sec x presents a difficulty 
that has not previously appeared. It is impossible to select an interval 
D, of values of x such that sec x is defined and continuous over D, and 
each y in the range of sec x is attained for some x in Dı. The best we 
can do is let D, be the interval 0 < x < m with the center point x = 1/2 
omitted. The inverse secant is then 
the inverse of the function fı with 
domain Dı for which fi(«) = sec x. 
The graph of y = sec™! x, which 
coincides with a part of the graph of 
x = sec y, is shown in Figure 8.38. 
Ifx > lorx < —landy = sec! x, Figure 8.38 
then sec y = x,0< y < r, y Æ r/2, 
and sec y tan y > 0. Again Theorem 8.33 implies that dy/dx exists, so 


sec y tan y 2 = =] 
and 


d ,.,_%y_ 1 ç __ l 
(8.381) dx ~*~ ax ~ secytany ee y| [tan y| 


~ jsec sec y| /sec? y — 1 J/ = y — ~ [xl fa? — 1 Ta 


This gives the formula 


(8.382) dx = sec! x + c, 


lars 


which is valid when x > 1 and also when x < —1. 
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The graph of y = csc x presents the same difficulty that the graph of 
y = sec x presented. The best we can do is let Dı be the interval 
—r/2 < x S1/2 with the center point 
x = O omitted. The inverse cosecant 
is then the inverse of the function f, 
with domain D, for which fi(x) = cscx. 
The graph of y = csc— x, which coin- 
cides with a part of the graph of 
Figure 8.383 x = csc y, is shown in Figure 8.383. 
If x > lorx < —1 and y = csc”! x, 
then csc y = x, —r/2 < y < 2/2, y #0, and hence csc y cot y > 0. 
Again Theorem 8.33 implies that dy/dx exists, so 


d 
— csc y cot y 3> = 1 


and 
d = d -1 ool 
(8.384) moo a csc y coty [esc y| |cot yl 
T f Vee i T ial vera T 
Problems 8.39 


1 Supposing that a > 0 and b? < 4ac, show how the first of the formulas 


(1) [ pew unua ina 


a? + u? 
ft un EE 
ax? + bx + ¢ 4ac — b? V tac — mT 
can be used to obtain the other two. Remark: Everyone should know how the 
last of these formulas and many similar ones are presented in standard integral 
tables. Let a > 0 and let 
X =ax?+bxe+e, q = b — 4ac. 
Then (2) takes the form 


1 2 2ax + b 
=~ dx = — tan”! +c 
Jz V —g V —4 


when g < 0 and hence —g > 0. The derivation of (3) is based upon the identity 
b c 
(4) X=alat+ix+ 5] = a[(#+2 x +ga) + G- x l 
Aa] af 
=a| (x + Ż) + ( x+ Z 


(3) 


Ye) | 
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which involves completing a square. It may be time to renew the idea that 
normal persons who have forgotten (or never remembered) needed integral 
formulas either work them out to satisfy their vanities and preserve their 
mathematical powers or stoop to hunting them in a book containing a table of 
integrals. 

2 Show that 


[ote _ = 
0 yet xt ~ By 
when y > 0 and that 


ayy 1 m log a 
f, tf, apex} dy = 4 
when a > 0. 
3 Supposing that —1 <x < land 
(1) T(x) = 23 cos (n cos™! x), 


find formulas for T(x) and Tp (x). Then multiply 7,(x) by n?, multiply T (æ) 
by —x, multiply Ta (x) by (1 — x?), and add the results to discover that 


(2) (l— x) sa * at ny = 


when y = T,(x). Remark: This problem involves much more than meets the 
eye. When n is a positive integer, T, is the Tchebycheff polynomial of degree 
n and (2) is the Tchebycheff differential equation of order n. One who wishes 
a simple and brief discussion of these astonishing things can find it in the author’s 
textbook ‘Differential Equations,” 2d ed, McGraw-Hill Book Company, Inc., 
New York, 1960. 

4 Supposing that —1 <x < l and u = (sin™! x)?, calculate the first two 
derivatives of u and show that 


a — at) 2S = 


5 The eye of a man is at a point E that is, as in Figure 8.391, x feet from a 
vertical wall bearing a picture the bottom and top of which are a and b feet above 


Figure 8.391 


eye level. The angle 0 which the picture subtends at the eye is surely small 
when x is small and when x is large. Derive the formulas 
_ a dð _ (b — a)(ab — x?) 
0 = tan tz tan y dx (a? + x)(b? + x) 


466 Trigonometric functions 


and show that 0 is a maximum when x = Vab. Remark: Addicts of elementary 
plane geometry can be pleased to see that the x which maximizes 6 can be found 
in another way. As in Figure 8.392 let C be a circle which passes through the 


Figure 8.392 


top and bottom of the picture and intersects the eye-level line L at two points 
E, and E». The angles which the picture subtends at Æ, and E; are equal. Of 
all such circles C that intersect L, the smallest one that produces the greatest 6 
is the circle Co for which Æ, and Æ» coincide, so Co is tangent to L. ‘The radius 
of Co is (b + a)/2, and use of an appropriate right triangle shows that the dis- 
tance from the wall to the center and to the critical point of tangency of Cù is 


2 IZN 
(? as 2) _ (? n *) or Gh. 
6 Supposing that a > 0 and |ax| < 1, prove the formula 


at+x 


71 tan~! a = tan`! 
tanl x + a 1— ox 


by showing that the two members of the equation are equal when x = 0 and 
have equal derivatives when |ax| < 1. 
7 With or without undertaking to prove the fundamental fact that cot™! x = 


tan! (1/x) when x > 0, suppose that x = 0 and fill in the omitted steps in the 
calculation 


—1 


1 + x? 


8 With or without undertaking to prove the fundamental fact that sec! x = 


cos~! (1/x) when |x| 2 1, suppose that |x| > 1 and fill in the omitted steps in 
the calculation 


cot™! d t not 
—_-,s x = — a -o = e es e = 
dx dx x 


La — 1 
sec”! x = =~ cos— 
dx dx 


1 
xo lel Vx? = 1 


Hint: Do not forget the chain rule and the fact that x? = |x| |x|. 
9 With the aid of Figure 8.343 we can see that the formula 


2 _ 
(1) sec”! x = sin7! ve = 


is valid at least when x > 1. Differentiate the two members of (1) and determine 
the set of numbers x for which the derivatives are equal. 
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10 Letting N be a positive integer (which we shall not cali “arbitrary’’), 
manufacture and solve N problems of the nature of the preceding one. 
11 Prove that 


[ee T ro d 
ole” ~F >» Irat =T 


12 Remark: We should not be so busy that we never have time to look at our 
formulas and see that we can learn things from them. Supposing for simplicity 
that x > 0, we can look at the formula 


z 1 
—! — a 
(1) tan"! x = o IF dt 


and realize that each member is equal to the area of the shaded region in Figure 
8.393. Any information we can obtain about tan™! x or the area or the integral 


2 ¢ 
Figure 8.393 


therefore provides information about all three. While there are much more 
complicated ways of getting the information, we can quickly learn some sig- 
nificant facts by starting with the formula 


1 2 A 6 ' fan+2 
(2) Topp tlre terete +t -IFE 


which can be obtained by long division and can be checked by multiplication 
by (1 + £). Integrating this over the interval 0 S ż S x gives the formula 


7 dnt 
(3) tanis =a- S44 S—F4...4 275 - Bw, 
where 
(4) Rae) = f° so ae 
o l+: 
Suppose now that 0 < x < 1. Then 1 S1 + #? S 2 and therefore 
z pint2 x 
(5) J, di S Rala) S f emede 
so 
Oo ot +3 
(6) 2(4n + 2(4n +3) = = R,(x) S aa + 4n + 3° 


This gives us an excellent estimate of R,(x) which implies the more crude estimate 


1 
(7) 0 S Ral) STF 
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This and the sandwich theorem imply that R,(x) — 0 as n — ©, and the defini. 
tion involving (5.622) shows that 


x3 x5 x7 x? x1! 
(8) 


tanix =s-z7z +z -7 +5- t tee 


when 0 < x < 1. Changing the sign of x changes the signs of both sides and 
shows that the formula is correct when —1 < x < 1. Putting x = 1 gives the 
famous Leibniz formula 


1,1 1,1 1, ... 
9) g~1-345-7t5-t > 


which is of particular interest to those who have not previously seen a formula 
from which r could be calculated. Putting x = 1 in (6) shows that we must 
take many terms of the series (8) to obtain a sum agreeing with 2/4 to three or 
four decimal places, and we say that the series ‘‘converges slowly.” The series 
in (8) converges more rapidly when x is nearer0. We are not now in the comput- 
ing business, but it is easy to verify that 


(10) i = tant + tan7! A 
by taking tangents of both sides, and to use (8) to show that r is roughly 3.14. 
The formula of Machin (1680-1751) 


(11) T =4 tant — tan”! en 


which is not so easily proved, is used by professional computers. 

13 Find whether the function f for which f(x) = x + sin x has an inverse 
and, if so, whether the inverse is differentiable. 

14 Let E be an interval of values of x; it could be the interval x > 0, and it 
could be the interval ~1 S x S 1. Let D be the operator or transformer or 
differentiator which, when applied to a function f which is defined and differenti- 
able over E, produces the function ¢@ for which ¢(x) = f'(x) when x is jn E. 
The domain A of Dis then the set of functions f that are defined and differentiable 
over E, and the range R of D is the set of functions ¢ that are defined over E 
and are derivatives of functions differentiable over E. Show that D does not 
possess an inverse. 

15 Let Ebe the interval —1 < x < 1. Let Dy be the restriction of the oper- 
ator D of the preceding problem to the domain A, consisting of functions f 
defined over E for which f(0) = 0 and f’ exists and is continuous over E. Show 
that D, has an inverse. 

16 Let Ebe the interval —1 < x < 1. Let J; be the operator or transformer 
or integrator which, when applied to a function g which is continuous over E£, 
produces the function G for which 


G(x) = f a at 


when x isin E. Describe the domain and range of J; and show that J; has an 
inverse, 
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17 Try tounderstand and even prove the statement that each of the operators 
D, and J, (of the two preceding problems) is the inverse of the other. 


8.4 Integration by trigonometric and other substitutions We 
begin with a statement of the fact that there is an elementary function 
F whose derivative with respect to x is +/a? — xt. Letting F denote 
one such function, we try to learn about F by writing 


(8.41) F(x) = | Va? = x dx 


and searching for an idea. Once upon a time somebody discovered that if 
we substitute x = asin 9, then the integrand will become »/a? — a? sin? 6 
or ~/a? cos? @, and this is a cos 0 if a cos 0 > 0. Thus a trigonometric 
substitution removes the radical and leaves +a cos 6, but we can still 
be unsure of the meaning of f cos 0 dx. Hence we must pause to make 
an observation. 

The chain rule of Theorem 3.65 tells us that if 


(8.42) F'(x) = f(x) 


and if u is a differentiable function whose range lies in the domain of F, 
then 


(8.421) 5 Fuld) = PUOC 
= fOO. 


This gives the following important substitution theorem which shows 
how to replace x by u(t) in integrals. 


Theorem 8.43 If 


(8.431) F(x) = ff(x) dx 
andaif u is a differentiable function whose range lies in the domain of f, then 
(8.432) F(u(t)) = Sf(u(t))u’ (t) dt. 


This theorem is used so often that it is worthwhile to be able to get 
from the right side of (8.431) to the right side of (8.432) in a purely 
formal way without thinking about the way in which the theorem was 
proved and the meanings of the formulas. We replace the old integrand 
f(x) by the new integrand f(u(t))u’(t) and the old dx by the new dt. 
This seems like a simple ritual, but the factor w’(t) might be forgotten 
when problems are being solved. It seems to be safer and easier to think 
of f(x) being replaced by f(u(#)) and dx being replaced by wu’(t) dt. If 
we follow historical precedents, we become carried away by our own 
enthusiasm and try to eliminate all possibility of overlooking the factor 
u'(t) by creating the fiction that dx is a number (a bunch of bananas 
would serve the same purpose) and w’(t) dt is the same thing. There is 
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a sense in which this whole business is utterly silly, but it really is a 
convenience to imagine that dx, u'(t), and dt are three numbers for which 
, . dx 
dx = u'(t) dt. When x = u(t), we can differentiate to get a= u’ (2), 
dx . . 

and the pretense that J is the quotient of two numbers then enables us 
to multiply by dt to get the formula dx = u’(t) dt that tells us to replace 
dx by u’(t) dt. Itis possible to try to say more about these matters but, 
for present purposes, the important thing is that we do not forget the 
factor u’(t). 

Since it is not always convenient to introduce a new symbol, such as 
u, for the function which carries ¢ into x, we restate the theorem in the 
following form which can be ignored by those who prefer the first version. 


Theorem 8.433 If 
(8.434) F(x) = Jf(«) dx 


and if x 15 a differentiable function whose range lies in the domain of f, 
then 


(8.435) Feet) = f FEDO a 
OT 

(8.436) F(x(t)) = j fle oy & a 
OT 

(8.437) F(x) = fte dx. 


We give further attention to changes of variables in integrals by prov- 


ing the following very useful variant of Theorem 8.433 which involves 
Riemann integrals. 


Theorem 8.44 If f is continuous over the interval a S x S b and if u 
is a function which has a continuous derivative and is such that u(t) increases 
from a to b as t increases (or decreases) from a to B, then 


(8.441) fe 1) de = ECOL 


To prove this theorem, let F be a function whose derivative is f so that 


ffx) dx = F(x) + c and 


(8.442) f 1e) & = F(x) |? = F® — Fla). 


The chain formula for derivatives then implies that 


64D fe flu@)w'&) dt = Fuld) | = Fu) — Flula)). 
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The required conclusion (8.441) then follows from the fact that a and 8 
are chosen such that u(a) = a and u(8) = b. Careful attention must 
be given to the manner in which the new limits of integration are deter- 
mined. When we set x = u(t), we must determine a such that x is a 
when ż is a and we must determine 8 such that x is b when ż = £. 

We are now ready to attack (8.41) and, since this is the first time we 
have made a substitution (or changed the variable) in an integral, we 
proceed with great caution. Supposing that a > 0 and that ~a S x S 
a, we set oe 


(8.45) F(x) = | Vat — x? dx 


and let x = asin 6, where —r/2 <0 S7/2. Figure 8.451 always helps 


x 
a? —x? 
Figure 8.451 


us to see what we are doing on occasions like this. Since dx/d@ = a cos @, 
we obtain 


(8.452) F(a sin 6) = f Va? cos? 6a cos 6 dé. 


But cos 6 2 0 when —r/2 S 0 < 1/2, so 


(8.453) F(a sin 6) = a | cos? 6d@ = Sja + cos 20) dé 


a? 
z (8+ sin 26] + c 
a20 + a? sin 0 cos 6] + c. 


I 


Since each x in the range —1 S x < 1 is obtained for a 0 in the interval 
—r/2 < 0 < t/2, we can use Figure 8.451 and the fact that cos 6 2 0 
to return to the original variable x so that 


(8.454) F(x) = $ G sin“ Heva — =| +c. 
This gives the formula 
(8.455) | Va — xdr =$ E sin=! +xVa— J +e. 


We are not always so careful about all of the details, and we do not always 
get correct answers either. There is a reason why we can sometimes be 
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careless about quadrants in which angles lie. There is a “theory of 
analytic functions” that guarantees that, in many situations, a formula 
which is correct when angles lie in the first quadrant will be correct 
wherever we want to use it. 

When we wish to make a substitution to evaluate the integral in 


1 
(8.46) I= J nr 

a/a — x? 
we can clarify our work and save writing the integral by denoting it by 
I, or by Iı or J if we wish to distinguish it from other integrals. We 


can then put x = a sin 6 and allow the variables to shift for themselves 
while we write 


(8.461) r= |- ode = [a0 
a“ COS 


=O+e=sin tite, 


We have evaluated an integral that previously appeared in (8.354). 

The identity 1 — cos? 0 = sin? @ provides the reason why the substi- 
tution x = asin 6 eliminated the square roots from the integrands of the 
above integrals. The identities 


(8.47) sec? 0 — 1 = tan? 6, tan? 0 + 1 = sec? @, 


which are obtainable from the one involving sines and cosines by dividing 
by cos? 6, are less familiar but are nevertheless important when we want 
to use them. Their uses are illustrated below. 

To evaluate the integral in 


(8.471) J= / ary te 


we look at it and generate the idea that we should try setting x = a tan 6. 
Then x’ = a sec? 6, so l 


l 2 1 1 
la sec? p)% SSC 8 do -3| 


x 
— 5“ +e 
a ~/x? + a? ? 
the last step being assisted by a figure showing 6, a, and x in the right 
way. 
To evaluate the integral in 


(8.473) h= | Gaye ts 


(8.472) J 


z | cos ode = sin 8 +e = 
a~“ a 
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we look at it and generate the idea that we should try setting x = asec 0. 
Then x’ = asec @ tan 0, so 


l 
(8.474) Ji = | (a? tan? 6) a sec 6 tan 6 dé 


_ 1 sec 0 1 cos 8 
a | tan?@ a] sin? 6 
l ° —? 1 sin 0)! 
=Z | Gino ? cos dg = =| 26 +c 
1 x 
= a Jaate 


It is not implied that we know in advance that the integrand should be 
written in terms of sines and cosines, but trying this possibility and seeing 
that the power formula can be applied is all a part of the game. 


Our last problem has a lengthy solution. We can bravely start to 
evaluate the integral in 


(8.48) J: = f Vx? — a? dx 


by setting x = a sec 0. Then x’ = a sec @ tan 0 and 


Jo = | Va tan? 0 a sec 0 tan 6 dé 


SO 
(8.481) Jo = af tan? 6 sec 6 dé. 


It happens that this integral is an elementary function, but there is no 
simple direct way of discovering the facts. Authors and teachers, as 
well as students, can work for hours on this problem unless they remember 
how to solve it or (and this much better) have a guiding principle. The 
principle is the following. If we want to learn something about an 
integral and other methods fail to be helpful, we try integration by parts. 
By this we mean that we try to determine two functions x and v of the 
variable of integration in such a way that the integral we are studying 
will be the left member of the formula 


(8.482) fulu (0) dd = u(6)0(6) — Sv(@)u'(6) dé 


and, moreover, the “parts” on the right side enable us to make progress 
with our problem. Assuming that u and v have continuous derivatives 
over intervals where we use the formula, differentiation shows that the 
formula is correct. The trick is to use it effectively. Naturally, we 
want the functions u, v, u’, 2’ to be as simple as possible, and it is quite 
easy to discover that the best way to convert J» into the left member of 


(8.482) is to set 


u(6) = a? tan 9, o'(0) = sec 0 tan 0 
u'(@) = a? sec? 8, v(0) = sec 0. 


(8.483) 
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It is always a good idea to display u and 0’ in one line and u’ andvina 
lower line and to know that the formula (8.482) for integration by parts 
says that the integral of the product of the top two is equal to the product 
of u and v minus the integral of the product of the bottom two.f Thus 


(8.484) Ja = a? sec 0 tan 6 — af sec? 6 dé. 


The last integral seems to be quite as recalcitrant as J2, and this really 
is true because 


a?f sec? 0 d0 = a®f sec? 0 sec 0 d0 = a?f[1 + tan? 6] sec 0 dé 
= a?f sec 0 d0 + a?f tan? 0 sec 6 dO 
= a? log (sec 0 + tan 0) te + Jz- 


Instead of developing faint hearts by throwing everything into a waste- 
basket, we substitute our last result into (8.484) and get 


(8.485) Jeo = a? sec 0 tan 0 — a? log (sec 6 + tan 0) — ¢ — Ja 


If we can now suddenly remember that we are trying to find J2, we can 
transpose the term — Jz (or add J2 to both sides of our equation if trans- 
posing is onerous) and divide by 2 to obtain 


(8.486) Jz = $a? sec 0 tan 0 — $a? log (sec 6 + tan 6) + c, 


where the new constant c, like the old —c/2, can be any constant. With 
the aid of a figure showing how 9, a, and x are related, we obtain 


(8.487) ha [V8 Ed = pva 
+ Vx? a 


x 
— 4a? log ————_—— +c. 
a 
If we wish, we can write the logarithm of the quotient as a difference of 
logarithms and combine the constant part with the c to obtain the formula 


(8.488) Jz = [| Vx? — a dx = $x Vx — a? 
— 4a? log (x + Vx? — a*) +c, 


in which ¢ has another value. The answers in (8.487) and (8.488) look 
different, but they are both correct. The last one may seem simpler, 
but the first one gives c = 0 if J2 = 0 when x = a. 

We should recognize the fact that all or nearly all of the integration 
formulas in the text and problems of this section appear in books of 
tables. We should hear more than once that books of tables are often 
used as labor-saving devices, but that many persons like to derive the 
formulas they use to preserve and improve their mathematical acumen. 


t These matters are very important, and it is required that we think about them more 
than once. Our concentrated attack upon integration by parts appears in Section 9.5. 
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1 By making an appropriate figure and trigonometric substitution, fill in all 
of the missing details which show that 


1 
(a) J Fe Be = | 800 00 = log (œ + VA +o 


1 1 
6) f sapere nz] seco do 


_1l at+x _ 1 a+x 
= 08 aay te = logs t+ (|x| < a) 
(©) fpi =i f coa 
1 x+a x—a 
= — |] ———— eee = — ——_ 
z log mmt 74 8a te (x > a) 


(d) f (a? — x?)% dx = af f cos! 6 d8 
2 When x > QO, the identity 


(1) r¢ la yŻ+l_ tl 


shows that 


1 _ n- l 
(2) J h+ta= fates Hd + | ae 


The first integral on the right is easily evaluated, and the second can be simplified 
by a trigonometric substitution. Use these ideas to derive the formula 


(3) f 1+4 dx = VIF + log x — log (1 + VIF #) + c. 


3 Sketch graphs of y = e and y = log x in a single figure and note that each 
is the mirror image of the other in the line y = x. Observe that the arc Cı on 
the graph of y = e7 which joins two points (1,91) and (f2,¢2) on the graph of 
y = & is congruent to the arc Cz on the graph of y = log x which joins the two 
points (41,1) and (g2,f2) on the graph of y = log x. Letting L denote the length 
of Cı and Ce, show that 


(1) L= [ENTF F et dy = i Ni + 5 dx. 


This gives the formula 


(2) [> JTL di = Í a h 4 4 dee. 


To show that errors and misprints have not led us to an incorrect formula, show 
that the substitution ¢ = log x converts the first integral in (2) into the second 
integral in (2). 
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4 In Problem 2 of Problems 4.39, we called attention to the important non- 
elementary beta integral formula 


Ig! 
(1) f x?(1 — x)? dx TETES 


which is correct when the integral is a Riemann integral and p and g are non- 
negative; we can now report that it is correct when the integral is a Riemann- 
Cauchy integral and p and g exceed —1. Show that putting x = sin? 6, where 
0 < 687/2 when0 Sx S 1, yields the formula 


r lzi 
(2) Í Ë intti @ cos? 0 dô = IGT 
Be sure to observe the fact that the limits of integration are correctly determined; 
setting x = 0 in formulas involving x is equivalent to setting 0 = 0 in formulas 
involving 6, and setting x = 1 in formulas involving x is equivalent to setting 
6 = w/2 in formulas involving 6. Finally, replace 0 by x and let 2p + 1 =a, 
2q¢ + 1 = B to obtain the formula 


(=): (E>): 
(3) fre sin® x cos? x dx = $ ED 
+S 


which is valid when @ and 6 exceed ~1. Remark: If we happen to know that 
(—%)! = Vr, we can conclude from (3) that 


— (~ | 
x/2, _ fr 2 
(4) f sin¢ x dx = i KON 
7)! 
5 Replace x by y? in the first formula of the preceding problem and then 
replace y by x to obtain the formula 


1 eu 
2 +g +1)! 


Tell why the limits of integration are correct. 

6 When we proved the formulas for derivatives of trigonometric functions, 
we were unwilling to base the proofs upon the formula 4 = $r?6 for the area of 
a sector of a circle having central angle 0 and radius r. Supposing that 0 < 
6 < x/2, this problem requires that the formula be proved. With the aid of an 
appropriate figure, show that 


1 
I xPHI(] — x?) dx = 


A= EO dx 


where 
f(x) = x tan 8 (0 <S x < r cos 8) 
f(x) = Vr? — x? (rcos9Sx Sr). 


Then find 4 with the aid of the formula (8.455) which has been proved. 
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7 It is easy to evaluate the integral 
Kı = fx sin x dx 
with the aid of the formula 
fu(x)o"(x) dx = u(x)o(x) — fo(x)u’ (x) dx 


for integration by parts. Do it by filling in the first and then the second row 
of the formulas 


u(x) = v(x) = 


u'(x) = a(x) = 


in the most useful way. Check the result by differentiation. 


8 Do not forget our guiding principle. If you have an idea other than inte- 
gration by parts for evaluating the integral 


Ke = fx? sin x dx, 


write the author a letter. Otherwise, develop a strong heart by solving the 
problem by integration by parts. 


9 When we get good basic ideas we can understand and sometimes even 
originate modifications of them. If Mr. Watson asks us to evaluate the integral 


1 
L= | ——— 
| JAAA 


we could eliminate the radical by setting x = tan? 0, but the result is unlovely. 
We can try to simplify matters by making a substitution x = u(t) so cleverly 
devised that Vx + 1 =ż The denominator will then be simply ¢ + 1 and the 
other details may be of such a nature that we can find L. Show that 


L=2vVx+1—2loøg(vVx+1+1)+c. 


Hint: The simple identity 


enables us to integrate the first member in a hurry. 


. ar 6 . . . 
8.5 Integration by substituting z = tan > This section provides 


useful experience in changing variables by treating a particular substitu- 
tion which is sometimes useful, but the section can be omitted without 
damaging understanding of the remainder of the book. 

We begin by deriving formulas which enable us to use the substitution 


, 0 . . 
(or change of variable) z = tanz to convert integrals of quotients of 


polynomials in sin 0, cos 6, tan 6, cot 0, sec 0, and csc 0 into integrals 


478 Trigonometric functions 


of quotients of polynomials in z. We suppose that —r < 6 < r, so 
that —r/2 < 6/2 < r/2, and define z in terms of 0 by the formula 


(8.51) z= tan x 


Differentiation gives 


dz _ 0 1 TAUR Ski 
Z z gst = 5(1 + tan 7) 7’ 
and hence 
2 
(8.52) dî = EE dz. 
Basic trigonometric identities and (8.51) give 
0 
2 tan >= 
2 2 
(8.531) sin 6 = 2 sin 5 cos 7 = — o = ri? 
ec? = 
2 
0 . 4 O 0 
_ a a 27 
(8.532) cos 0 = cos z ~ sin’ 5 2 cos 7 1 
__2 l = 2 —] = l-z 
g0 -= Ip 1+ 2? 
sec? = 
2 
and, except when @ is 7/2 or —7/2, 
0 
2 tan = 
(8.533) tan 0 = 2 = 22 
6 l-z 
1 — tan? 7 


In fact, Figures 8.541 and 8.542 enable us to write the six trigonometric 
functions of 6/2 and 0 in terms of z. Thus each of sin 0, cos 6, tan 9, 


4 
x 
pe y 2z 
A 
z 
A) - 1—2 
Figure 8.541 Figure 8.542 


cot ĝ, sec 4, csc 0, and d6@/dz is a quotient of polynomials in z. It follows 
from this that if P and Q are polynomials in the six trigonometric func- 


tions of x, then 
P P: 
-= dð = | >d 
J Q J O°” 
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where P, and Qı are polynomials in z. Several examples appear in the 
problems at the end of this section. 


Except in special cases, evaluating integrals of the form 


| aon” t aT H -> F amt + am gy 
boz? + bizm! -+ + ba—1% + bn 


is very tedious business. When n S 2, and in a few special additional 
cases, answers can be found in tables of integrals. Some information 
about the matter will appear in Section 9.4 of this book. 


Problems 8.59 


1 By use of the substitution z = tan 5, show that 


— 


1 1 6 
(a) [ pata | ea log |z| + ¢ = log | tan5|+¢ 
l _ _ Jr-z+1+2z, 
(2) | ape | ie = | apa 
6 
1 + tans 
1 1 2 
= | (Gite alte 
l — tanz 
1 2 
© [am - | ees 
ee ee ee 
@ | ym” ~ J (a — c)z? + 2bz + (a+c) z 


sin 8 + cos 8 z(1 — 2) (1 + 22 — 2°) 
o Sta] (1 + 27)4 dz 


2 , 1— 2 
D reptensmaten feed 2] Tra” 
(g) log HA bes logsecO+e= f tanode=4f rEg 4 dz 


(h) (et rea) Femme eo es 


=f bcos 
TF b sinô 
1 — 2? 


= 2b | (az? + 2bz + a)(l + 5 


sin 0 4g, 
©) Berak g 79 = J G+tait+a” 


cos 6 1 — 2? 
D [me a= | ae j 
— 1 — z2 6 
= 21? os = Dtant (tang) — tang +e 


9 Exponential and 


logarithmic functions 


9.1 Exponentials and logarithms At least a modicum of basic 
information about exponentials is possessed by everyone who looks at 
Chapter 9 of a calculus book. Everyone knows that 34 = 3-3-3-3 and 
3% = 4/3. Such formulas as 


a*t = Vas = (/a)5, 


in which it is supposed that a > 0, are familiar. Remark 20 which 
appears among the problems at the end of this section provides basic 
theory of a* for the case in which a > O and the exponent x is a rational 
number, that is, x = P/Q, where P and Q are integers for which Q = 0. 
There is a reason why this theory is often neglected. It is too difficult 
for elementary algebra books because it requires use of the intermediate 
value theorem and hence requires knowledge of completeness of the real- 
number system. It is too simple for calculus books because it is mostly 


elementary algebra. The mathematical theory of exponents, like the 
480 
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physical theory of hydrogen, is neither so brief nor unimportant that it 
is unworthy of several hours of study. 

We start here with a given number a, the base of our exponential func- 
tion a”, for which a > 1. We suppose that a? has been defined for rational 
numbers x and that, if x and y are rational numbers for which x = P/Q, 
then 


(9.11) aa! = arty, (a)¥ = qr, a? = (aP) te = (a!!Q)P. 


For the purpose of learning about mathematics in general and about 
the exponential function a? in particular, we 


make a direct frontal attack upon the problem y 
of learning more about a? when x is rational. ° 
We then use this information as a basis for def- y=a* 
inition and study of a? when x is real. Our “rational o 


first step is to calculate a” for several values of 
x and to start making a graph of y = a? as in 
Figure 9.12. We mark only the points whose 
coordinates we have calculated and overcome 
the primitive urge to draw a “smooth curve” 
through these points. We have experimental 
evidence that 0 < a? < a*** when x and h are 
rational and h > 0, and this matter should 
be investigated. When x = P/Q, a'Ì is the 
positive number 7 for which r? =a and Figure 9.12 
az = (al@)P = r? > 0, soa? > 0. Itturns out 

that we can obtain a huge amount of information from the identity 


(9.13) atè — gt = (aè — l)a. 


For a modest beginning we suppose that h = p/q, where p and g are 
positive integers, and show that a > 1. When f(x) = «%, we see that 
f(1) = 1 and that f(x) is continuous and increasing when x > 0. Since 
a> 1, the one and only number x, for which f(xı) = a must exceed 1. 
Thus x? = aand a/¢ = x, where xı > 1, and hence aè = x? > 1. Thus, 
when h > 0, the right member of (9.13) is the product of two positive 
numbers and hence is positive. This implies the following theorem. 
Theorem 9.14 If x and h are rational and h > O, then 


(9.141) 0 < at < att. 


This means that, as a function of the “rational variable” x, the function 
having values a? is positive and increasing. Proof of this fundamental 
fact represents a triumph of mind over matter, but one triumph is not 
enough and we must continue. Tables and slide rules and brute-force 
calculations with pencils produce experimental evidence in support of the 
following theorem. 
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Theorem 9.15 As a function of the “rational variable” x, the function 
having values a? is continuous over —œ© <x < ©. 
We shall prove this theorem with the aid of (9.13) and an estimate of 
the troublesome factor (aè — 1) that is, from our present point of view 
and from some others, quite amazing. 
pa P i p Let h be a positive rational number for 
0 1 * which h S 1 and let n be the greatest 
Figure 9.151 integer for which nh < 1. If we wish, 
we can draw Figure 9.151 and put a 
part of the graph of y = a?” over it to help us see what we are doing. The 
equality 


(9.152) (aè — 1)a@-DA = gkh — glk-Dh 


holds when & = 1,2,3, - - - , n and summing over these values of k gives 
the formula 


(9.153) (a — 1) LÈ ald | = $ (ath — g(k—Dh) 


ka 
(aè — a°) + (aè — at) + (a — a4 -+ e + (gr — aD) 


= a" — a? = grhkh—_ ] <a—l 


which has, in its middle line, a sum which is called a telescopic sum because 
it “telescopes” to a™* — a°. More critical examination of the sum in 
brackets will appear later. Meanwhile, we observe that if we denote 
the sum by S, then each term in the sum is 1 or more, so S 2 n. But 
nh > $, son>1/2h and S > 1/2h. Replacing S in (9.153) by the 
smaller number 1/2h gives the inequality 


(a —-1) $ <a—1. 


2h 
Therefore, 
(9.154) aè — 1 < 2(a — I)h 0 <khkz<al). 
This and (9.13) show that the formula 
(9.155) ath — gt S 2(a — 1)ha” 


is valid when x and h are rational and O < h < 1. This result and the 
fact that a” is increasing give the following theorem. 
Theorem 9.16 If r, and rz are rational numbers for which 
É I2- nl =AS1 
and if M is a rational number for which rı S M and re < M, then 


|a” — a"| S 2(a — 1)|re — riļa™. 


This is a stronger version of Theorem 9.15 which implies Theorem 9.15. 
Our information about the values of a” for rational numbers r is now 
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very substantial, and we proceed to use it to define and study a* when x 
is real. Let x be a real number which is not necessarily rational. In 


case x is rational, the fact that a’ is an increasing function of the rational 
variable r implies that 


(9.17) a? = |.u.b. a’, 


rsz 


where the right side is the least upper bound of the set of numbers a’ 
for which 7 is rational and r < x. In case x is irrational, we define the 
number a® by this same formula (9.17). To use this definition, we need 
only a very little information about least upper bounds of sets of num- 
bers. We need only the elementary fact that if each number in a set Sı 
is also a number in a set Se, then the least upper bound of the first set 
must be less than or equal to the least upper bound of the second set. 
Suppose xı and xz are given real numbers for which xı < x2 and we choose 
two rational numbers 7; and r2 such that 


Hy L Tı << fe < Ko. 
Then, since a” < a”, 


a®! = l.u.b. a’? < l.u.b. a? = a" < a” = l.u.b. a < Lu.b. a” = a”, 
ríz rsri T ST? Tíz 


and this proves that a7 is an increasing function of x. If 0 < ô< M, 
we can choose a positive integer n such that a* > M and a™ < 6. 
This and the fact that a” is positive and increasing imply that a7 — 0 
as x— — œ and af— © as x— œ. To prove that a® is continuous 
(and in fact uniformly continuous) over the interval x < M, we can 
employ Theorem 9.16. Let e> 0. We can then choose a 6 > 0 such 
that la — a| <e whenever rı < M, r: < M, and lro — ri| < ô. 
Whenever xı and xz are real numbers for which xı < M, x2 < M, and 
|x2 — xı| < 6, we can sandwich xı and xz between rational numbers 
rı and ro for which rı < M, re < M, and |rz — rı| < ô. The fact that 
a7 is increasing then implies that |a” — a”| < ja" — a™| < e. 

To complete our study of the basic theory of a* when x is real, we must 
prove the formulas 


(9.18) ata! = att, (a*)¥ = a™, 


Since a? is continuous, we can let 71, fa * © * and 51, 52, © * © be sequences 
of rational numbers converging to x and y so that 
ata¥ = (lim at*)(lim a’) = lim (a*a) 
N> N— © n— © 
= lim arrten = qzuty 


n—> 0 


and the first formula is proved. To prove the second formula, we vary 
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the procedure by letting r, s, t, and u be rational numbers for which r < 
xSsandisysu. Then 


axa < a and (ar)! < (a7)! < (a*)*, 
SO 
art < (aF)! < az, 


Taking limits as rt — xy and su — xy gives a™ < (a*)¥ S a%™ and shows 
that (a7)! = a”. 

Our basic theory of exponents enables us to give the basic theory of 
logarithms in a few lines. With the aid of the intermediate-value 
theorem, we see that to each positive number x there corresponds exactly 
one number y such that a = x. This number y, an exponent, is called 
the logarithm with base a of x and is denoted by loga x, so that the formulas 


(9.181) a! = x, y = loga x, agot = x 


are equivalent. The logarithmic function is the inverse of the expo- 


Figure 9.182 


nential function, and its graph is shown in Figure 9.182. The funda- 
mental formulas 


(9.183) logs xy = loga x + log, y, loge x” = y loga x 
are proved by setting u = log, x, v = loge y, x = a", and y = a’, so that 


xy = a*a? = qt 
and 


loga xy = u +v = loge x + loga Y. 
Moreover, x” = (a*)¥ = a", so 
loge x” = uy = y log, x. 


When a, b, and x are positive numbers for which a > 1 and b > 1, 
we can equate the logarithms with base b of the members of the identity 
x = at? to obtain the first of the formulas 


(9.184) log, x = (loga x) (log, a), 1 = (loga b) (log, a), 
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and putting x = b in the first gives the second. Putting b = eand a = 10 
gives the formula 


(9.185) log « = (logy x)(log 10) = logio x 
lo 10 £ 
which, together with the estimates 


(9.186) log 10 = 2.30258 50929 94045 68402 
(9.187) login € = 0.43429 44819 03251 82765, 


enables us to find log x with the aid of a table of values of logio x. These 
formulas are shunned when a log-log slide rule is available and gives 
satisfactory accuracy and when a Satisfactory table of values of log x 
is available. When decimal representations of numbers are used in cal- 


culations, it is often necessary to know that each positive number y is 
representable in the form 


(9.188) y = 10", 
where n is an integer and 1 S x < 10, and that 
(9.189) loga y = n loga 10 + loga x. 


This formula enables us to find loga y with the aid of a table giving values 
of loga x when 1 < x < 10. The formula works whenever a > 1. Itis 
simplest when a = 10 and log, 10 = 1; in this case n loga 10 is an integer 
(the characteristic of the logarithm) and rules for finding it are sometimes 
peddled without revelation of the fact that they are identical with the 
rules for finding the exponents in the representations 


416.3 = 10? 4.16 
0.00004163 = 10-5 4.16 
4.163 = 10° 4.163. 


Our derivations of formulas for derivatives and integrals of logarithms 
and exponentials will come in the next section. Meanwhile, we close 
this introductory section with some historical remarks. The first pub- 
lished table of logarithms, the “Mirifici Logarithmorum Canonis - °° ,” 
by John Napier (1550-1617), appeared in 1614. The rare-book collection 
of the University of Illinois contains this book and an astonishing col- 
lection of old tables. To indicate that these books and their titles are 


nontrivial, we cite the full title of the 1631 edition of the book of Napier.t 


t Napier, John, ‘Logarithmicall Arithmetike, or tables of logarithmes for absolute 
numbers from an unite to 100000; as also for sines, tangentes and secantes for every 
minute of a quadrant: with a plaine description of their use in arithmetike, geometrie, 
geographie, astronomie, navigation, etc. These numbers were first invented by the most 
excellent Iohn Neper, Baron of Marchiston, and the same were transformed, and the 
foundation and use of them illustrated with his approbation by Henry Briggs, Sir Henry 
Savils Professor of geometrie in the Universite of Oxford. The uses whereof were written 
in Latin by the author himselfe, and since his death published in English by diverse of his 
friends according to his mind, for the benefit of such as understand not the Latin tongue,” 


London, 1631, 819 pp. 
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For more than 300 years, logarithms with base 10 were systematically 
and extensively used to make arithmetical calculations. Respectable 
scientists of the present and future must know about them, but sub- 
stantially all of the chores formerly done with the aid of tables of logis x 
are now done with slide rules and mechanical and electronic calculators 
and computers of assorted shapes and sizes. It seems that the first 
published table of logarithms with base e appeared in a 1618 edition of 
the Napier tables, and that John Speidell used e as a base of exponentials 
in a book published in 1620. Much of the present usefulness of e is 
based upon work of Euler (1707-1783). Except for collectors of rare 
books, tables of e7, e-*, and log x are now vastly more valuable than 
tables of logio x. 


Problems 9.19 


I Find the values of a, x, and y that satisfy the equations 


(a) 27 = 32, a = 32,25 = y 

(b) x = log: 32, 5 = loga 32, 5 = loge y 

(c) 74 = y, 77 = 2401, a* = 2401 

(d) 4 = log; y, x = log; 2401, 4 = loga 2401 
(e) aè = 1000, 10? = y, 107 = 1000 

(f) 3 = log, 1000, 3 = logio y, x = logio 1000 


2 Practice the art of starting with the first of the equations 
y = af, log y = x log a = (log a)x, y = eke, 


taking logarithms (with base ¢) to obtain the second equation, and then using the 
definition of logarithms to obtain the third equation where k = log a. 
3 Practice the art of starting with the formula y = a? and writing 


y= az = (e*)= = ef, 


where k is the exponent which we must put upon e to get a and hence k = log a. 
4 Using the method of Problem 2 or the method of Problem 3, show that 


(a) x7 = etloez (x > 0) 
1 log (1-+2) — log 1 
~ log (1+ — 
E) (+a = eo Et Oe (œ > —1) 


(c) f(x = ole) logs) (f(x) > 0) 


5 Show that ah? = ez when ke = kı log a. 

6 Sketch graphs of the equations y = 27 and y = loge x on the same sheet 
of graph paper. Tell why the line having the equation y = x is (or is not) a 
line of symmetry of the set consisting of the two graphs. Sketch a line which 
appears to be tangent to the graph of y = 2? at the point (0,1), estimate the 
coordinates of the point where this line meets the line having the equation y = x, 
and use the results to obtain an estimate of the slope of the tangent. Finally, 
modify the procedure to obtain an estimate of the slope of the line tangent to 
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the graph of y = logs x at the point (1,0). Remark: This is not a dull problem, 
because moderately careful work produces very good results. To check one of 
the results, let f(x) = 27 so f(x) = e782 so f'(x) = et182(log 2) and f’(0) = 
log 2 = 0.693 cot, 

7 Apply the procedure of Problem 6 to the equations y = 37 and y = log; x. 
Try to find a way to use a table or slide rule to check the numerical results. 

8 Many persons with scientific interests have (and should have) log-log 
slide rules and should know or learn how to set them to obtain values of a? when 
2S aS3and —1 Sx#81. These persons can quickly produce a good graph 
of y = (2.5)? over the interval —1 S x S 1 and use it to obtain a good estimate 
of the slope at the point (0,1). Repeating the process for y = (2.75)? and using 
ideas involving interpolation lead to quite good approximations to a base ¢ for 
which the graph of y = e7 has slope 1 at the point (0,1). This base ¢ is the famous 


1 1 
9 Tell why f 22 dx and I, 37 dx exist. Then sketch appropriate graphs 


and use them to obtain rough estimates of the values of these integrals. Finally, 
check the estimates with the aid of the formula 


1 1 1 1 
j) a? dx = J e? log a dx = ez log | 
0 0 log a 0 


1 a— 1 
~ To a (ei “Y= log a 


10 Tell why Í, l 1007 dx could not be 0.05 and could not be 500. 


11 Discover reasons why we could suspect that 
N at dt < —5 l =e £ 


when a > 1 and x > 0. 
12 Sketch graphs of the equations 


(a) y = 2-4" (b) y = 277 


Remark: More detailed information about the graph of the equation y = 6717 


will appear later. 
13 In Problem 4 of Section 12.6, we shall discover that if z > 0, then 


(1) log z! = 3 log 2r + (z + 5) log z — z + E(2), 
where E(z) is a number for which 

1 1 1 1 1 
(2) Ta ~ 3008 < EO < Ty, ~ 3608 T 120 


Show that (1) can be put in the form 
(3) log z! = log V 22m + log #7 — z + E(z) 
and hence that 


(4) gi = ~/ 2am zee, 
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Remark: The formulas (1) and (3) and (4) are examples of Stirling formulas for 
log z! and ai. 
14 This problem requires that we capture the ideas used to obtain the number 


y = 20-41631 690216 


correct to 10 decimal places, it being assumed that a table giving e416 correct 
to 15 places and a desk calculator are available. The first step is to recognize 
that y = AB, where A = e7, B= eh, x = 0.4163, and h = 0.00001 690216. 
The number 4 is copied from the table correct to 12 or 15 places. The number 
B is readily calculated to 12 or 15 places with the aid of the formula 


h he ht 
@=ltht+az+etato 


which we shall learn about in a problem at the end of the next section. After 
a few buttons have been pressed, the calculator will give the product y = 4B 
in a hurry. 

15 This problem requires that we capture the ideas used to obtain the number 


z = log 4.16316 90216 


correct to 10 decimal places, it being assumed that a table giving log 4.1631 cor- 
rect to 15 places and a desk calculator are available. The first step is to recognize 
that 

4.16316 90216 


z = log 4.1631 + log ——q 7637 


and hence that z = 4 + B, where 4 = log x, B = log (1 + A), x = 4.1613, and 


z= 0.00006 90216 
~ 41631 ` 


The number 4 is copied from the table correct to 12 or 15 places. The number B 
is readily calculated to 12 or 15 places with the aid of the formula 


h h hd 
lg (l+h=h-Z+yz- | 


The sum 4 + B can be obtained with a pencil, but it is safer to use the calculator. 
16 Supposing that a desk calculator is available to perform additions, sub- 
tractions, multiplications, and divisions, describe the steps by which the numbers 


(a) 27, @) a, (ey, (D, e) Va 


can be calculated correct to 12 decimal places. Hint: Take logarithms with 
base e so that tables of values of e” and log x can be used. Partial ans.: Letw = q" 
and obtain the formula log w = C, where C = AB, A =r, and B = logr. 
Find a table giving correct to 15D (that is, 15 decimal places). Then use the 
method of Problem 15 to find B correct to 15D. Then multiply to find C. 
Then w = ¢. Let C = n + x, where n is an integer and 0 £ x < 1, so w = 
e*e*, Find æ by multiplication or from a table. Find e7 by the method of 
Problem 14. Finally, multiply to get w. 


9.1 Exponentials and logarithms 489 


17 Persons who start fires with matches can be interested in the fact that 
fires can be started by rubbing sticks together. We shall soon learn modern 
ways of calculating approximations to exponentials and logarithms, but we can 
be interested in seeing that more primitive methods will work. Supposing that 
a and x are positive numbers given in decimal form, and that 0 < x <1, we 
outline an old, old procedure for finding decimal approximations to a? that was 
used when all computations were made by hand and when letters A,B,C, --- 
of the alphabet were used in place of xo, #1, xa °° >. To get started, let yọ = 
0, zo = 1, and observe that the inequalities 


(1) Vn í xs Zn, aun <= qt S ain 


hold when n = 0. Supposing that n = 1, calculate xı and then a7! from the 
formulas 


(2) Xn = Zal T Fant + Žel 


En = 4 / Unan 
3 afr = q aun 1 až» 1, 


and let Yn and Zn be two of the three numbers Ya—1) Xn, Zn-1 Chosen in such a way 
that the inequalities (1) hold when n = 1. Repeating the process with n = 2 
gives (2) and then (1) when n = 2, and the process can be continued. Since 
Zn = Yn + 1/2", we can be sure that our estimates of a? will be good when is 


large. To find i F 
(V2 + v3 


correct to 32 decimal places by this method (or by any other method) would be 
quite a chore for an inexperienced person, but persons who run modern computers 
can enjoy making such computations. Primitive methods often work better 
than fancier methods of limited applicability. 

18 Persons possessing slide rules may wish to study them while reading 
something that tells why they work. The C and D scales contain numbers from 
1 to 10 and, when it is supposed that these scales have unit length (this unit is 
usually about 10 inches), the number x lies logio x units from the left end. To 
find the product X of two positive numbers 4 and B, we put them in the forms 
A = 107a and B = 10% where m and n are integers andl S a < 10,1 <b < 10 
and use the fact that X = 107x where x = ab. Noting that 


(1) logio x = logio a + logio b, 


we run along the D scale a distance logio a to the number a and then, after setting 
1 on the C scale opposite a, run along the C scale a distance logio b to the number b. 
We have then gone the distance logio x from the left end of the D scale and hence 
can read x on the D scale. 

19 Persons possessing log-log slide rules may wish to see why and how the 
esoteric LL scales are made. Suppose we wish to find the number y for which 
y = b1 when b and g are given. We note that 


(1) log, y = q log, b 
and 


(2) logio loge y = logio loge b + logio g. 
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The LL3 scale contains numbers from ¢ to about 22,000. The distance from the 
left end of the LL3 scale to a number y is logio log. y. If we run a distance logy, 
loge b on the LL3 scale to hit b and then, with the aid of the C scale, run an addi- 
tional distance logio q, we go a total distance logi log. y and hence can read y on 
the LL3 scale. We can be comforted by discovery that log-log slide rules give 
the simple formulas 34 = 81 and 42-5 = 32 as well as others that are not so easily 
verified. It is often particularly useful to know that if points b and f lie opposite 
each other on the LL3 and D scales, then 


(3) logio loge b = logio f 


so loge b = f and b = ef. Thus natural exponentials and logarithms are easily 
read, and the good approximations ¢? = 20, log 20 = 3 are always available to 
show us which way to read the scales. With the aid of this and similar informa- 
tion about other scales, it is possible to both understand and use slide rules. 

20 This is the promised development of the theory of a? for the case in which 
x is rational. Let a be a positive number. It is the base of the exponential 
function a? which we are about to define when x is an integer, then when x is the 
reciprocal of a nonzero integer, and then when x is a rational number which is not 
necessarily an integer. If n is a nonnegative integer, we define a” and a™ by 
the formulas i 


—— . a gane > o @ SB m 
(1) a” = l-a-a-a-a a, a zaa.. -ù 


? 

where in each case 1 is multiplied by a exactly n times. If n = 0, no multiplica- 
tions are involved and a? = 1. Clearly, a! = a, a! = 1/a, a = aa, a? = 
1/a-a, etcetera, and a™ = 1/a” or aa" = a? = 1. Counting numbers of times 
by which 1 is multiplied or divided by a enables us to show that the laws of 
exponents 


(2) atat = arty, (a*)¥ = a7” 


hold whenever x and y are nonnegative integers and then whenever x and y are 
integers. Let n be a nonzero integer and let f(x) = x” when x >0. Then 
f'(x) = nx”, so f is continuous and increasing when n > 0 and is continuous and 
decreasing when n < 0. Moreover, f(x) > 0 as x > 0 and f(x) > œ as x — œ 
when n > 0, and f(x) — œ as x— 0 and f(x) ~ 0 as x— © whenn <0. In 
each case these facts and the intermediate-value theorem 5.48 imply that there 
is exactly one positive number Å for which 4” = a. We then define a!” by the 
first of the formulas 


(3) gin = h, a= h’, (ail/n)n = a, (a”) Un = a, 


remember that the first formula is equivalent to the second, and observe that the 
last two formulas are correct. Supposing that A, p, and g are integers for which 
Aq Æ 0, we acquire ability to manipulate these things by proving the formula 


(4) (ad) 1a = (a?) te = (a!la)? = (alMa)>p, 


To prove this, let H be defined by a!ı = H, so that a = H~ = (Hò) and 
ava = H>. Thus (4) will be true if 


(5) ((H*)\7) a = ((H)P) a = (HY)? = H™, 
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But (Hs)? = (7) and (H^)? = (H>?)s, and it follows that each member of 
(5) is Hò. Thus (5) is true and this proves (4). Just as the rational number r 
defined by r = —$# is representable in the forms 


63 _ @(-9) _ -9 


“<8 7 

so also each rational number r given in the form r = P/Q, where P and Q are 
integers, is uniquely representable in the form 2/4, where P = Xp, Q = M, and 
A, p, and q are integers for which g > 0 and the integers |p| and q are relatively 
prime, that is, have no common positive integer factor different from 1. This 


fact and (4) show that when P and Q are integers for which Q =Æ 0, we can define 
aP/@ by the first of the formulas 


(6) aPIQ = (aP) = (qle)P 


with assurance that the whole formula is correct and that the formula remains 
correct when we multiply or divide both P and Q by the same integer à provided 
A Æ 0 when we divide. This completes the definition of a* when x is rational. 
To prove that the laws (2) hold when x and y are rational, we can let x = p/n 
and y = g/n, where 9, q, n are integers and n > 0, to obtain 


atal = (arin) (aat) = (al/n)p(giln)a = (ain) pta = giptg)in — aztu 
and 
((a*)¥)”? = (((aP/”)@) tn) n? = (((a?)1n)¢)n = (((a?)2)ln)n = gP¢ 
sO 
(a*)¥ = (apa) tn? = arm? = ary. 


9.2 Derivatives and integrals of exponentials and logarithms 
Let a > 1. Looking forward to a derivation of a formula for the deriva- 
tive of the function f for which f(x) = a*, we use h instead of Ax, write 


(9.21) ath — gt = (aè — 1)q?, 
and observe that 

d ._, İċ a |, a—1] , 
(9.22) Tn? = tim ZTE = fiim 7 |e 


provided the limits exist. If we succeed in proving that, for some con- 
stant 4 which may depend upon a, the first of the formulas 


h — ho 
(9.221) im 22124 im l -=4 


is valid, then the second will also be valid because 


h>0— A h04 —A 0+ A h—0+ h 
and we will be able to conclude that 
d 


(9.222) 5a Ae. 
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To investigate the factor aè — 1, we employ the method which we used to 
prove Theorem 9.15 when we knew practically nothing about exponentials. 
Let h be a number, now not necessarily rational, for whichO < h < land 
let n be the greatest integer for which nh $1. The equality 


(9.23) (aè — 1)a®-Dh = gkh — gh-Dh 


still holds when & = 1, 2, 3, - - + , n, and summing over these values of 


k gives, as in (9.153), 


(9.231) (a* — 1) [ » gph | = grh — 1 
k=l 
and hence r] 
à — 1 a*-— 
(9.232) 7 = < 
ak-Dhh 
k=1 


The numerator of the right side converges toa — 1 as h— 0. If to the 
denominator of the right side we add the negligible term a™*(1 — nh), 
the new sum will be a Riemann sum, with partition points 


O<h<@*Aah<3h<--- <nh<l 


in the interval O S$ x < 1, which converges to the positive number 

1 oe . 
f a* dx as h, the norm of the partition, approaches 0. Because it is 
sometimes extremely helpful to be able to recognize Riemann sums when 
they appear in somewhat disguised forms, we give careful attention to 
the details. If we set f(x) = a7, set x, = kh when OS k SX n, set 
Xng1 = l, setxf = x,whenl Sk < n +1, set Ax, = hwhenl Sk Sn, 
and set Ax,+4ı = 1 — nh, then the new sum becomes the Riemann sum 


n+i 
(9.233) » fla®) Axe 


and as h, the norm of the partition, approaches 0 the Riemann sum 
1 
approaches h f(x) dx, that is h a* dx. Therefore, 


h — 
(9.24) im = 471 Ly 
mo A Í a* dx 
0 


where 4 is the constant defined by the last equality. It is possible to 
squeeze information from this formula, but we obtain a formula giving 
a simpler relation between a and JA before actually showing that there is 
one and only one number a for which 4 = 1. 

We have obtained the first of the formulas 


d T == T d — 1 
(9.25) ie = Aa, Flogs = >. 
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Since a* has a positive derivative, Theorem 8.33 implies that its inverse 
loga x, defined over the infinite interval x > 0, is differentiable. Hence 
we can differentiate the first of the formulas 


(9.251) qaz = x, Aas z logax = 1 


with the aid of the chain rule to obtain the second and hence obtain the 
second formula in (9.25). Conversely, if the second formula in (9.25) 
is known to hold, then differentiation of the members of the formula 
loga a” = x gives the first formula in (9.25). Since loga 1 = 0, replacing 
x by ż in the second of the equivalent formulas (9.25) and integrating 
over the interval from 1 to x gives the formula 


(9.252) A loga x = | 1 dt. 
1 


Since loga a = 1, putting x = a gives the formula 
(9.253) A= [te 
1 


which clarifies the relation between a and JA. 

Exponential functions and their derivatives and integrals occur so 
often that the constant 4 in the above formulas would, if allowed to 
survive, be an insufferable nuisance. Hence we can relish the proof that 
we can choose a to make 4 = 1. If we let f(x) denote the right member 
of (9.252), then f is continuous and increasing over the whole interval 


x > 0, f(1) = 0, and 
4 
1 
(9.254) fa) = | ta >b+h+Er 
1 


There is therefore exactly one positive number e for which 1 < e < 4 
and the constant 4 in (9.253) and preceding formulas is 1 when a = e. 
Thus all of the above formulas are correct when a = e and 4 = 1. 
In particular, we have proofs of the basic formulas 


d enge 2 _i 
(9.26) mom z 108 * = 5 


and hence also of their companions 
(9.261) fe dx =e+ 0, [5% = lgx +e. 
Setting a = e and A = 1 in (9.24) and (9.253) gives the formulas 


(9.262) hp ra=e-1 [yeah 
0 1% 
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which we can now obtain very easily 
by evaluating the integrals. Each 
of these formulas actually deter- 
mines ¢, and the second one says 
that e is so determined that the area 
A of the region shaded in Figure 
9.263 is 1. Many other formulas 
Figure 9.263 involving e will appear later, and one 

of them will enable us to calculate ¢ 
correct to 5 or 10 decimal places with surprisingly little effort. Mean- 
while, we can remark that many persons remember the 16D (16-decimal, 
but only 15 after the decimal point) approximation to e by mentally 
grouping digits in the form 


(9.264) e = 2.7 1828 1828 45 90 45 


so we can visualize the repeated 1828 followed by 45 and twice 45 
and 45. 

The number e is the natural base of exponentials and logarithms. 
Other bases sometimes appear. ‘The base 10 is used when we give a 
number n and say that “there are 10" atoms in the universe,” but not 
even Eddington suggested that this should be differentiated with respect 
ton. We never differentiate or integrate a*'* unless ais e. If, as never 
or rarely happens outside misguided examinations in calculus, we are 
called upon to differentiate or integrate a** where a ¥ e, we write 


(9.265) ghz = pha 


and take logarithms with base e to obtain kıx log a = kx and hence 
ko = kı log a. We then work with e* instead of a. 

Our theory of exponentials and logarithms enables us to provide the 
promised proof of the power formula which, for convenience of reference, 
we put in a theorem. 

Theorem 9.27 If n is a constant, integer or not, then 


dx 
when x > Q. 

To prove this theorem, let y = x” and take logarithms to obtain 
log y = n log x. Since log y is differentiable, we can, because y = e% +, 
conclude that y itself is differentiable and can use the chain rule to obtain 
(1/y)dy/dx = n/x so dy/dx = nx. This proves Theorem 9.27. We 
are now able to prove the last of the basic limit theorems, Theorem 3.288, 
which we now restate. 
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Theorem 9.271 If pis a constant positive exponent, then the first of 
the formulas 


(9.272) lim x? = a’, lim x? = a? 
ad t—a+ 
holds when a > O and the second holds when a = 0. 

To prove this theorem, let p be a given positive constant and let f 
be the function for which f(x) = x? when x 2 0. That f is continuous 
at a when a > Q is a consequence of the fact (see Theorem 9.27) that 
f'(a) exists when a > 0. This proves that the formulas (9.272) are valid 
when a > 0. Since 0? = 0, it remains to be proved that 


(9.273) lim x? = 0. 
20+ 
With the aid of the facts that 
(9.274) lim log x = — œ, lim ¢ = 0, 
x—0+ h= ~ œ 


this follows from the equality 
(9.275) x? = erlar, 


We conclude with some remarks that lead to a formula which has not 
appeared in our work but which nevertheless has much more than his- 
torical interest. If we know the basic properties of logarithms, we can 
let L be the function for which L(x) = log x when x > 0 and obtain the 
formula 


(9.28) Le tH E) _ log (x +h) —logx 1 h 


= — ~ = 7 log I+ 


1x h l h\zlh 


which is of interest from more than one point of view. If we can show, 
without reference to our previous results, that there is a number e for 
which 
h\=alh 
(9.281) lim (1 + z) = e, 
h-0 x 

then we can adopt this new number e for the base of our logarithms and 
use (9.281) and the fact that L is continuous and L(e) = 1 to obtain 
another proof of the formula L’(x) = 1/x. This formula then provides 
a back-handed proof of the formula de*/dx = e*. The difficulty in this 
view of (9.28) lies in the fact that “direct” proofs of existence of the 
limit in (9.281) are neither brief nor quickly comprehended. A proof 
is given at the end of the problems of this section. We now look again 
at (9.28) and realize that we have proved that L’(x) = 1/x and hence 
that the members of (9.28) must converge to 1/x as h— 0. Since L 
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is a continuous increasing function for which L(e) = 1, the formula 
(9.281) must be correct. Thus we have an “indirect” proof of (9.281). 


Replacing x and h by their reciprocals shows that (9.281) implies the 
more interesting formulas 


. xN . x \? 
(9.282) lim (1 + 3 = e, lim (1 + z) = g7, 
[h h {hl h 
The most famous of these formulas, obtained by setting x = 1 and con- 
sidering Å to be a positive integer, is 


(9.283) lim (1 + x) =e. 

n> æ n 
Since 1 + 1/n > 1, the result in (9.283) furnishes a very interesting 
compromise between the exponent n whose growth tends to make the 
quantity large and the term 1/n whose decrease tends to make the 
quantity near 1. 


Problems 9.29 


1 When we know some calculus, we can quickly dispose of problems that 
were troublesome. Show that the functions having values e7” and log x are 
increasing. 

2 With the aid of information yielded by first and second derivatives, sketch 
a graph of y = e? and the tangent to the graph at the point (0,1). 

3 With the aid of information yielded by first and second derivatives, sketch 
a graph of y = log x and the tangent to the graph at the point (1,0). 

et y = xe-*. Show that - 
£ Lety eA = -£ 


yE) = (1 ~ wee. ae 
Show that y’(1) = 0, y’(x) > 0 when x < 1, and y'(x) < 0 when x > 1. Show 
that 
y” (x) = (x — 2)e7*. 
Determine when the slope of the graph is increasing and when it is decreasing. 
Use your information to sketch the graph. 
5 Show that 


d 
(a) T T? = xe (b) z geinz = eña? cog x 


d 
(O fn = te (D) ee = abertas 
6 Show that 
d 2logx , d 
(a) zg; (log x)? = ——3= (4) 7, log log x = Toes 
d . d 2x 
(c) a log sin x = cot x (d) = log (1 + x°) = IF 


7 Textbooks in differential equations provide substantial information about 
some of the situations in which a “population” or something else depends upon 
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t (time) in such a way that its rate of change with respect to ¢ is proportional to 
it. There are in fact many situations in which it is assumed that y is a function 
of z such that, for some constant &, 


(1) am 


at each time z. We have thought about this matter before, and we shall think 
about it again. The pedestrian way to start to extract information from (1) 
is to divide by y. Without assuming that y Æ 0, show (with complete attention 
to each detail) that if (1) is true, then we can transpose ky and muliply by e~# 
to obtain the first of the formulas 


d 
(2) dt ehy = Q, ekty = A, y = Aek 


and then the remaining ones in which Æ is a constant. Continue the work to 
show that if y satisfies (1) and the boundary condition 


(3) y = yo when ż = Q, 
then 
(4) y = yoe*. 


Remark: This problem provides a major reason why exponential functions are 
important. In many cases, y decreases with passage of time and & is negative. 
In such cases there is a positive number T such that 


(5) yoe! = yoekt+7 


for each ¢. This number T, which is called the half-life of y, is the number of 
units of time required for half of y to fade away. From (5) we find that $ = eT 
or eT = 2 or —kT = log 2 or 


log 2 _ _ 0.69315. 
ko k 

This formula can be used to determine the half-life T when we know k, and it 

can be used to determine k when we know the half-life. 


8 Supposing that 0 < x S J, observe that 
(1) O Seea, 


(6) T=- 


Replace x by ¢ and integrate over the interval from 0 to x to obtain 
(2) Os ec —1 S eAx. 


Replace x by ¢ and integrate over the interval from 0 to x to obtain 


2 
(3) OSe—-l—-xseF; 
Repeat the process to obtain 

2 3 
(4) OSe-l-x-Fsez 

x? æ xe! 
(5) O8e—-l-e-7-7 2° 
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With or without more attention to details, jump to the conclusion that, for each 
n = 1, 2, 3,° À 3 


xk gntl Anti 
(6) ose- È HS GFI GFN 


While this may not be a suitable time to worry about the details, we can be sure 
that if 4 = 400, then the quantity 

a AAAAA AA A 

! 12345 n—-ilnn+l1 


will be large when n is about 4. The quantity will nevertheless be near 0 when 
n is sufficiently great. Therefore, 


k 2 3 4 
(7) e? = lim $ Haitet+HtatAt e 
While (7) is more spectacular than (6), it is not always as useful. Show that 


putting x = 4 = 1 in (6) gives the formula 


3 


3 1 1 1 
© Ose-(ititgtytgt: +a Sup 


Verify that 


1=1. 
1=1. 
1/2! = 0.5 

1/3! = 0.16666 66666 
1/4! = 0.04166 66666 


1/5! = 0.00833 33333 
1/6! = 0.00138 88888 


and that the next term is obtained by dividing by 7. Continue the work to 
obtain a decimal approximation to ¢ that is correct to 6D (5 decimal places after 
the decimal point). 


9 Supposing that x < 0, observe that 
(1) O0se7 81. 
Replace x by ż and integrate over the interval from x to 0 to obtain 
(2) Osl-—esS —x. 


Replace x by and integrate over the interval from x to 0 to obtain 


(3) OSe—-—1—x <= a 
Repeat the process to obtain 
x? x3 
(4) OSiltety-"38-F 
2 3 4 
(5) Ose-1-%-2-2<% 
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Note that considerations very similar to those in Problem 8 establish validity 


of the formula 


k 2 3 4 


when x <0. For some purposes, (4) and (5) and their extensions are more useful 
than (6). 
10 Have another look at the formula 


2 3 4 
ealtet5Rt+qt+pt 


and remember it. Then write the formula and use it to obtain an approximation 
to e>? which agrees with the idea that (e)? = e. 
11 Prove one of the inequalities 


e>1+x%, log (1 + x) Sx, lgx Sx —l1 


and show that each implies the other two. Hint: If no better idea appears, 
find the minimum value of e? — 1 — x. 

12 Prove that if x > 0 and k is a positive integer, then 
xe o (k+l = (k +1)! 


0 < ez xktl ~ x 
Use this result to show that 


k 
- x 
lim — = 0. 
2— œ ez 


13 We can become accustomed to the formulas 


(1) ns 
2 sitte np l 
(2) 1-: +t t+- +t iZ; 


if we see them often enough. Prove that if ~1 <S x < 1, then integration from 
0 to x gives the formula 


1 x? x’ x” 
(3) logy zz 5sty tzt ty te 
where 

z 4” 

(4) = T; % 
In case 0 S x < 1, show that 
5 xz 4r ntl 
(3) osr f -ë -G@dbDd-w» aF- 
and in case —1 S x < 0, show that 

o p _ jp 1. 
6) Rl =|fo papel sf ral =- r 
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Hence prove that 

(7) log eae teat tet 
when —1 Sx < 1 and that 

(8) lg2=1—-#4+4-E+4-@+°°° 
Show that replacing x by (—x) in (7) gives the formula 


3 4 
(9) log (l +a) =2 -4+5 - -7t 


and that addition gives the formula 


(10) og H = 2(1+5 +55) 


which holds when |x| < 1. Use (10) and a little imagination to obtain the 
formula 


1 1 1 
(11) g2 =2(}+zptzgt o) 


from which we could, with the aid of a calculator or computer, calculate log 2 


correct to many decimal places. Some persons need approximations as good as 
that in 


(12) log 2 = 0.69314 71805 59945 30941. 


If we want log 10, we could get log 8 by multiplying (12) by 3 and then add the 
result to log ($£) which can be calculated from the formula 


(13) leg = 2(5+g55t em tz t*’) 
in which the series converges quite rapidly. The result is 
(14) log 10 = 2.30258 50929 94045 68402. 


{7 
To get log 3, we could let x = $ in (9), but this series converges rather slowly. It 
is much better to calculate log 9 and then log 3 from (14) and the formula 


(15) les = 2(5 taataa ‘) 
This gives 


(16) log 3 = 1.09861 22886 68109 69140. 


14 Elementary combinations of the logarithms of 2, 3, and 10 give the loga- 
rithms of 2, 3, 4, 5, 6, 8, 9, 10, but 7 is missing. Show how the proximity of 
49 to 50 enables us to calculate log 7 with the aid of a series that converges 
rapidly. 

15 This book does not recommend learning a formula for dy/dx when y = 
f(x)", where f and g are given functions. If (as frequently happens in good 
old-fashioned mathematics examinations) we are required to produce the deriva- 
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tive, we can begin by being irked by the fact that the base is not e. There are 
two superficially different ways to put the given equation in a form to which 
basic rules are applicable. Taking logarithms (with base 2) gives 


a) log y = g(x) log f(x) 
and, with or without using (1), we can put the equation in the form 
(2) y = e(z) log f(z), 


When (1) is used, we differentiate to obtain 


ld 
(3) y T = g(x dT 5f (x) + g'(x) log f(x) 


and, with the aid of the fact that y = f(x)9), 

d 
(4) = = f(x) | Se aon x) + g'(x) log fæ) |. 
When (2) is used, we differentiate to obtain 


dy = pfx) log g(x) | 
dx 


(5) 


l 
z g'(x) + f’(x) log g) | 


and then use (2) to obtain the result (4). With the above formulas out of sight, 
use one of the above ideas to obtain a formula for dy/dx when 


f y = x? Ans: 2 = x*(1 + log x) 
dy x — (1 + x) log (1 + x) 
= Wz os = V/z ——_ +O Fe 

(b) y = (1 + x) Ans. = (1 + 2) UF a) 
(c) y = (sin x)sinz Ans.: a = cos x(1 + log sin x)(sin x)si== 
(d) y = (sin x)= Ans.: a = (1 + sec? x log sin x)(sin x)t2* 
(e) y = xez Ans.: 2 = 2xl =~! log x 
(f) y = a Ans.: 2? = abt log b log a 
(g) y = x® Ans.: = Z = hb? log x [ log 6 + =z] 


16 One who is interested in solving a puzzle with an interesting answer may 
undertake to determine the nature of the graph of the equation y = x”. One 
who has never thought about the orders of magnitudes of the numbers (0.01)? ° 
and ($) may even be surprised by the results. 

17 Ifthe result has not already been obtained, let y = x7 and show that 


y” (x) = x2(1 + log x)? + x27! > 0. 


Tell what this says about the graph of y = x*. 
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18 Derive the formula 


1 
| qe =e loge $1) $e 


with the aid of the identity 
1 eple er o 
et] e+] ew +1 
19 Letting F(0) = 0 and F(x) = e=: when x Æ 0, show that the formula 


=1— 


(1) Fu (x) = Pole) ell? 


holds when x =Æ 0, k = 1, and P(x) = 2. Supposing that (1) is valid when £ is 
a given positive integer n and P,(x) is a particular polynomial in x, differentiate 
(1) to obtain a formula which shows that (1) holds when k = n + 1 and 


(2) Paal) = (2 — 3nx®)Pa(x) — x°P,(x), 


so that Pas; is a polynomial in x. This shows (the principle involved being called 
mathematical induction) that (1) holds for each k, Px being a polynomial in x. 

20 Prove that if P and Q are polynomials in x for which Q(x) is not always 
zero, then 


(1) lim ay gi = Q, 


r—0 x 
Solution: We can determine constants such that be Æ 0, q ïs an integer, and 


(2) P(x) — ao + A% + + anx™ 


e 


O) botbw t Foe ™ 


It therefore suffices to prove that 


(3) lim |x|¢e-V/2* = 0, 
z—0 


or, as we see by setting ¢ = 1/x?, 


(4) lim ¢7@/2g7t = 


i-> œ 


But when ż > 1 and k is a positive integer for which —q¢/2 < k, 
tk 
(5) [e-alBe—t] < te = Pi 


Our conclusion is therefore a consequence of the formula 


(6) lim E =0 


to e 
which is proved in Problem 12. 
2I Let F be the function for which f(0) = 0 and 


(1) F(x) = Yz 


when x = 0. With the aid of results of the two preceding problems, prove that 
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F and each of the derivatives F™, n = 1, 2, 3, -+ - , is continuous over the 
whole infinite interval — œ < x < œ and, moreover, 
(2) F(0) = 0 (n = 1, 2,3, - +»). 


Remark: This is one of the famous functions of mathematical analysis. Note 
that we have no formulas into which we can put x = 0 to obtain the numbers 
F (0). After we have learned that F“(0) = 0 when k = n, we must use the 
definition 


F+0(0) = lim PO (x) — F (0) 


20 x 
to learn that F)(0) = 0 when k =n +1. 
22 Supposing that ai, ae, az, ` - > are positive numbers, that n is an integer 


for which n > 1, that x > 0, and that 
airt ast >: + anitx 


(1) Fal) = FSS 
Qian ° °> ° An-1% 


prove that fa(x) is a minimum when and only when 


„untat ss + anı, 


n— 1 


(2) x 


Hint: Use the fact that fa(x) is a minimum when log f,(x) is a minimum. Show 
that 
d 
G) F log fala) 
_ n—1 art aot +++ + ana], 
= aatar paral- n— 1 | 


Remark: Qur result shows that 


(4) Fn (e a) = Jalan) 
and that equality holds only when a, is the arithmetic mean (or ordinary average) 
of the numbers aj, do, * * * , @n-1. Interest in this matter can start to develop 
when we observe that 

Qtattt: tan 


n 

me 

Waa, °° an 

and hence that fn(an) is the ratio of the arithmetic mean of the numbers a, @2, 


` , An to the geometric mean of the same numbers. By using (1) to obtain 
an expression for the left member of the formula 


ai + a + e. + an1 (n~1)fn 
atat ter n—1 


n—1 (aja a ee an-1) "7D 
= [fn-1 (an1) ] 072 


and applying some quite elementary algebraic operations to simplify the result, 
we can derive the formula. From (6) and (4) we obtain the formula 


(7) [Sa-i (án1) "7! S [fa (4n)]". 


(5) Jalan) = 


ORAI 
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Since f:i(a:) = 1, this gives the remarkable parade of inequalities 


a, + aN? a; + az + a;3\3 a; + az + az + ag\! 
9 1s\—2_]} <\__4._) «<\|__ __»—_ 
NV a\a2 V 414243 V 41490304 


The denominator in these quotients cannot exceed the numerators, and therefore 


(9) vya a, < GEE Ea a= 123.. 3. 


n 


IA 


The simple corollary (9) of the more spectacular result (8) is a statement of a 
very famous theorem which says that if ai, do, * * * , Qn are positive numbers, 
then their geometric mean is less than or equal to their arithmetic mean. Our work 
enables us to show that equality holds only when the numbers ai, az, ` * > , Qn are 
equal. Many proofs of this theorem are known, and it is quite appropriate to 
become interested in the matter by looking at the special case 


(10) 27a;aza; £ (a1 + a2 + a3)? 


and seeking ways to prove it. Sometimes scientists say that boys work with 
equalities and men work with inequalities. 
23 For those who are interested in the matter, we present a direct proof that 


(1) lim f(x) = e when f(x) = (1 + ly 


[zl œ 
When 7 is a positive integer, putting a = 1 and b = 1/m in the binomial formula 


(2) (a + b)” = gnpo + nar 1p! + nn =I ar~ 2þ2 


4 mn ~The gr-3B3 p - 4 nn — Tm A 2 apn 


gives 


3) fn) =14+145(1-4)4+4(1-2)(1-5) 
taG-DO-DO-ato 
0-90- 


1 


n! 
1 1 1 
<l¢(l+5t yt +R) <3 


It follows from Theorem 5.65 that the series in 


Hence 


(4) fe) S1+14+Q4+q4-°- 


(5) ft) <1414+n4¢q4¢q¢ °° 56 


converges to a number which we can call z and that (5) holds. If m > n, then 
replacing n by m in (3) shows that f(m) consists of (m + 1) terms of which the 
first (n + 1) equal or exceed those in the right member of (3) and the remaining 
m — n terms are positive. Hence f(m) > f(n) when m >n. As we can see 
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by putting sn = f(n) in Theorem 5.651, this and (5) imply existence of a number 
L for which 


(6) lim f(n) =L Se. 


N—> 00 


To prove that L = e, we can use (3) and (5) to obtain 
i¢1494(0.-2)+90-DO-2)+-- 
AEDES (2S) ene 


when n > N. Letting n—> œ gives 


1 
(7) L+lt+atat- ta Slse 


and letting N —> © shows that L = e. Supposing now that x > 2 and that x 
is not necessarily an integer, we can let n be the greatest integer in x so that 
n<xx2in-+1. Then it is easily seen that 


(8) (G+ sosy 


The first and last members of this inequality converge to e as n— œ because 
they are respectively equal to 


® (toa) Ctra) C+ Cts) 


and the second factors converge tol as n— œ. Therefore, f(x) — e as x — ©. 
Proof of the part of (1) involving negative values of x is provided by the calculation 


. 1 
(10) im 1+=) = lim (1-3) = lim 


zI— © 


-5 


= é. 


x2— 0 x— i) 2—> 0 


= lim (1 + = lim 4 


9.3 Hyperbolic functions We begin with a peek at some formulas 
that are very useful in more advanced mathematics and will appear again 
in Section 12.4. The formula 


2 3 4 
(9.31) e=ltetataytat 


which has been proved to be valid when z is a real number, is used to 
define e7 when z is a complex number of the form z = x + ty, where x 
and y are real and i is the so-called imaginary unit for which 1? = —1. 
Since 72 = —1, 78 = —21, 74 = 1,25 = 1, it can be shown that 


, g? gt zê 
(9.311) = (1 -nata gt oe ) 


. 3 g5 g1 
r(e-grg-4t) 


? 
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and hence that 


(9.312) ev 
(9.313) "B 


cos z +72 sin z 
cos z — 1 SiN zZ. 


Adding and subtracting give 
(9.314) COS z = one, sin z = — £. 


These are Euler (1707-1783) formulas. They are widely used, particu- 
larly in electrical engineering, to replace calculations involving sines, 
cosines, and their positive powers by simpler calculations involving expo- 
nentials. We do not need to understand these matters now, but we can 
at least acquire a vague feeling that trigonometric functions are related 
to exponential functions by formulas very similar to those which relate 
hyperbolic functions to exponential functions. In any case, we can know 
that there is a reason why formulas involving hyperbolic functions are 
so similar to formulas involving trigonometric functions. Modern 
scientists know that their ancestors complicated many problems by 
habitually using trigonometric and hyperbolic functions in situations in 
which results are obtained much more neatly and quickly by use of 
exponentials. Thus hyperbolic functions are introduced to students 
with the hope that they will (insofar as they can control their own 
activities) use the functions only for purposes for which they are useful. 

We now return to our usual situation in which all numbers are real. 
The hyperbolic sine, hyperbolic cosine, and other hyperbolic functions are 
defined by the first of the following equations, and calculation of the 
derivatives gives practice in differentiation. 


(9.32) sinha = A 


, L sinh x = cosh x 


2 
(9.321) cosh x ZE, 4 cosh x = sinh x 


dx 
(9.322) tanh x = en L tanh x = sech? x 
(9.323) coth x = ate L coth x = — csch? x 
(9.324) sech x = -a z sech x = — sech x tanh x 
(9.325) csch x = 2, che = csch x coth x 


et — emt dx 


These and many other formulas are very similar to trigonometric for- 
mulas, but differences in signs must be noted. As is the case for trigo- 
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nometric functions, the first and last are reciprocals, the next to the first 
and the next to the last are reciprocals, and the middle two are reciprocals. 
With the aid of the fact that e*e-* = 1, we can square cosh x and sinh x 
and obtain the first of the formulas 


(9.33) cosh? x — sinh? x = 1 
(9.331) tanh? x + sech? x = 1 
(9.332) coth? x — csch? x = 1. 


To obtain the second (or third) formula, we can divide by cosh? x (or 
by sinh? x) and transpose some terms. Graphs of the first three hyper- 
bolic functions are shown in Figures 9.34 and 9.341, and the others are 
easily drawn. 


Figure 9.34 Figure 9.341 
To work out formulas for the inverses of the first hyperbolic functions, 


we let 


t —t t— a 
(9.35) x = cosh t = 2 TE, y = sinh t = Ê A 


and observe that cosh ż is increasing over the interval ¢ 2 0 and sinh ż 
is increasing over the whole infinite interval. The equations (9.35) can 
be put in the forms 


(9.351) e% — 2x + 1 = 0, e% — Dyet ~ 1 = 0. 
These equations are quadratic in ¢*, and solving for e gives 


(9.352) e=x+ Vx? —1, e=ytvyt+ 1, 


it being necessary to choose the positive sign in each case because ¢ 
is positive and increasing as x and y increase. Taking logarithms and 
changing y to x gives the first items in the first two of the following for- 
mulas. Similar methods and differentiation give the remaining items. 
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(9.36)  sinh™? x = log (x + Vx’ + 1), join e5 Er 


(9.361) cosh“! x = log (x + Wx? — 1), z cosh" x = = 
(x > 1 
(9.362) tanh! x = 4 log — Z tanh" x= a | 
(|x| < 1 
(9.363) coth! x = 3 log ~ + i, = coth x= = | 
(x| > 1 
(9.364) sech7! x = log I+vi-x Lg, — sech™! x = Ei 
O0 <x<l) 
(9.365) csch! x = log L+ VIF £ esch x= == 
(x > 0) 


Because of the growing tendency to use exponentials to eliminate 
calculations involving hyperbolic functions and even trigonometric func- 
tions, it seems unwise to devote more time to the formal aspects of the 
subject. It is, however, of interest to know how hyperbolic functions 
are related to hyperbolas. Setting 


t —t t — p—™~t 
(9.37) x = cosh t = E, y = sinh z = Ê 7 


we see that x > 0 and x? — y? = 1, so P(x,y) lies on the branch of the 
rectangular hyperbola shown in Figure 9.371. A reasonable way to try 


Figure 9.371 Figure 9.372 
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to discover the role of the parameter ż is to let r be the vector running from 
the origin to a particle which occupies the position (x(t),y(r)) at time z. 
Then 


(9.381) r = cosh “i + sinh żj 
and differentiation gives the velocity and acceleration 


(9.382) v = sinh żj + cosh gj 
(9.383) a = cosh #i + sinh #. 


Thus a = r, so the particle is accelerated directly away from the origin, 
and the magnitude of the acceleration is proportional to (actually equal 
to) the first power of the distance. It is then known from classical 
physics that the particle must (as it does) move on a conic. Moreover, 
because the force is a central force which is always directed away from 
or toward the origin, the angular momentum of the particle must be 
constant and the vector from O to P must sweep over regions of equal 
area in equal time intervals. With (or perhaps even without) the aid 
of principles of physics that tell us to examine areas, we can calculate 
the area of the shaded region of Figure 9.371 and show that the area is 
cosh—! x and hence is ¢t. The problems show how the details can be 
handled. Thus the geometrical similarity between trigonometric func- 
tions (which used to be called circular functions) and hyperbolic functions 
(which still are called hyperbolic functions) is exposed. Trigonometric 
functions of ¢ are “functions of the sector of area t of the unit circle shown 
in Figure 9.372.” Hyperbolic functions of ¢ are “functions of the sector 
of area ¢ of the unit hyperbola shown in Figure 9.371.” Those who have 
not peered into ancient mathematical tomes and are more accustomed 
to “‘sines of angles” than to “sines of arcs” and “‘sines of sectors” should 
quietly observe that the sector of the unit circle of Figure 9.372 has 
area t when the arc from 4 to P has length ¢ and hence when the angle 
AOP “contains” ¢ radians. Among other things, this little excursion 
into history explains the antics of those who write arcsin x in place of 
sin-! x and talk about “the arc whose sine is x.” 


Problems 9.39 


1 Verify the six formulas for derivatives of hyperbolic functions. 

2 Verify the six formulas for derivatives of inverse hyperbolic functions. 

3 Textbooks on differential equations show that if a flexible homogeneous 
cable or chain is suspended from its two ends and sags under the influence of a 
parallel force field (an idealized gravitational field) then the cable or chain must 
occupy a part of the graph of the equation 


(1) y= 5 (erz 4 ght) = T cosh kx, 
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provided that the positive constant & and the coordinate system are suitably 
determined. Because of this fact and the fact that the Latin word for chain is 
catenus, the graph of (1) appearing in Figure 9.391 is called a catenary. Supposing 


y 


O x 
Figure 9.391 


that a > 0, find the length Z of the part of this catenary that hangs over the 
1 
interval 0 Sx Sa. Ans: L = Z sinh ka. 


4 Compare the graphs of y = sech x and y = 1/(1 + x?). 
5 Starting with the formula 


t = log (x + Vx? — 1), 
show that x = cosh ?. 
6 Prove the formula 


[ Veni dx = $x Vx? — 1— $ cosh! x + c 


with the aid of (8.488) and use it to show that 
cosh™! x9 = 2 EE Vx —-1- [Pv x? — | dx. 


Use this to show that cosh! x is the area of the shaded region of Figure 9.371 
when x = xo. 


7 Evaluate the integral of the preceding problem with the aid of a hyper- 
bolic function substitution. 
8 Show that 
e7 = cosh x + sinh x 
eT? = cosh x — sinh x. 


Show that if there exist constants cı and c for which 

(1) F(t) = eye! 4 c0, 
then there also exist constants Cı and C, for which 

(2) f(t) = e[C, cosh bt + C: sinh di]. 


Use (1) to calculate formulas for f’(#), f’(#), f’"(t), and f(t). Use (2) to cal- 
culate f(z) and f’’(#) and observe that the hyperbolic functions are being nuisances. 
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9 Supposing that a and b are constants for which b? < a’, obtain the formula 


at 7 — 
f sat cosh bt di = E (b ih he- < cosh bt) +e 


by integrating by parts twice. For the case where b? = a? see the next problem. 
10 Considering separately the three cases in which b? x a?, b = a, and b = 
—a, evaluate the integral 


4 J [elat + efa->)t] dt 


and put the answers in terms of hyperbolic functions. 
11 Determine whether 


(a) lim [ "dt = f "eh dy 
a-—I1 Jil 1 
(b) lim f7 ert cosh bt dt = f 7 et cosh at di 
ba JO 0 


9.4 Partial fractions Let 


(9.411) f(x) =x? — 2x +1 + — + 


x l 
rs ro 


Use of basic integration formulas then gives 


(9.412) [fe de =Ë tet 2 log le — 1| + $ log (x? + 1) 

+ tanix + ca 
Adding the terms in the right member of (9.411) gives 
x5 — 3x4 + 4x? — x? + 3x 


xi — x? +4 — ] 


(9.413) fx) = 


and shows us that the integral 


5 _ 2nd 3 yd 
(9.414) J? 3x4 + 4x x + 3x gy 


x— x? +x — 1 


is equal to the right member of (9.412). Interest in this business starts 
to develop when we wonder how we would evaluate the integral (9.414) 
if it were handed to us without the preceding formulas. The answer to 
this question is quite straightforward. We learn and use a procedure 
by which the preceding formulas can be worked out. 

The first step is to look at (9.414). The integrand is a quotient of 
polynomials in x, and the degree of the numerator is not less than the 
degree of the denominator. In such cases we employ division (or long 
division) to obtain a polynomial and a new quotient in which the degree 
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of the numerator is less than that of the denominator. Letting f(x) be 
defined by (9.413), we divide to obtain 


3x? + 1 
(9.42) f) =x? -~de+ 1+ aoa TT’ 


Our difficulties will have been surmounted when we succeed in expressing 
the last quotient (or fraction) in (9.42) as a sum of simpler quotients (or 
fractions) that are called partial fractions. ‘This brings us to the key 
problem of this section, namely, the problem of representing a given 
proper rational function (that is, a quotient of polynomials in which the 
degree of the numerator is less than that of the denominator) in terms of 
partial fractions. This key problem is important outside as well as inside 
the calculus. The problem is treated in some algebra books, but students 
normally make their first acquaintance with the problem in calculus 
books. 

To begin discovery of the partial fractions whose sum is a given proper 
rational fraction, we must factor the given denominator and use these 
factors to determine the nature of the partial fractions. Electronic 
computers are often used to factor denominators, but factoring of the 


denominator of the quotient in (9.42) is quite easy if we happen to notice 
that 


xs — ee + x l [= (x — 1) + (x — 1) = (x — 1)(x2? + 1). 
Thus the quotient in (9.42) is the left member of the equality 


3x2 + 1 _ A Bex +C 
(9.43) G@-D)@+Fl~x-1' +1 


While we are entitled to be quite mystified by the fact until the matter 
has been investigated, it is possible to determine three constants 4, B,C 
such that (9.43) is true for each x for which the denominators are all 
different from zero. In fact, we can clear the denominators from (9.43) 
and determine the constants so that the formula 


(9.431) 3x2 + 1 = A(x? + 1) + (Bx + C)(x — 1) 


is true for each x. One way to prove this and to find the constants is to 


put (9.431) in the form 

3x2 + 1 = (4 + B)x? + (—B + C)x+ (4 — C) 
and to observe that this is surely an identity in x if 

A+B =3, —-B+C=0Q, A-—-C=1 
and hence (as we show by solving these equations) if 


(9.432) A=2, B=1, Cel. 
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There is, however, an easier way to find 4, B, C directly from (9.431). 
Putting x = 1 in (9.431) shows immediately that 4 = 4(2) and hence 
that 4 =2. Putting x =0 shows that 1 = 4—C or 1=2-C, 
so C =1. Finally, putting x =2 shows that 13 = 2-5 + (2B + 1), 
so B = 1. Substituting from (9.432) into (9.43) gives 


3x? + 1 x+1 


2 
(3.433) G Ney tA T 


Since this is the last quotient in (9.42), we obtain (9.411). Thus we 
have succeeded in writing the given quotient in (9.413) as the sum of a 
polynomial and partial fractions. 

Some problems are easier than the one we have solved, and some are 
more difficult. When we wish to integrate the left member of 


2x +6 _ A 4 B 
(x-a +D x-1I°x ST 


we determine 4 and B so that (9.44) and 


(9.44) 


(9.441) 2x +6 = A(x +1) + Bix — 1) 
hold. Putting x = 1 shows that 4 = 4, and putting x = —1 shows 
that B = —2. Thus 

2x + 6 4 2 


(@—1(e+tl) x-1 FI 


and integration gives 


[ FRB de = 4 log le = 1 = 2log ie + 1 e. 


x? — 1 


The result can, for better or for worse, be put in the form 


2x +6, _ (x — 1)! 
|Z a = Ehe 


The problems at the end of this section provide additional clues to 
methods by which quotients are expressed in terms of partial fractions. 
Problem 11 proposes study of basic theory, and persons interested in 
more than the simplest mechanical aspects of our subject can do this 
studying at any time and perhaps even more than once. 


Problems 9.49 
I Show that, when p ¥ q and a > 0, 
l 1 x— 4 
1 ——___________ qx = —— lo +c. 
o | opent ag o FS 
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Remark: We should not be too busy to see how this formula leads to another 
that also appears in tables of integrals. Let 


(2) X=avrr’+bete, 

where a, b, c are constants for which a > 0 and b? — 4ac > 0. Observe that 
(3) X = a(x — p)(* — 9), 

where b and q are the values of x for which X = 0, so that 


p — 4 —b — Vb? — hac /b? Fa 
4) paty” te acho h cie, p qn Ve 


Substituting in (1) gives the formula 
(5) Iz dx = og | ae tb = VE A Bae) 
X WE” S | Zax + b + VE — 4ac 
We should know that additional tricks produce additional formulas that appear 


in tables of integrals. When a > 0, integrating the first and last members of 
the identity 


(6) x 1 (2ax+b)—b _ 1 2ax+d b 1 


ax? + be te = 2a ax? + be te ~ Ja ax? + bx te — Qaax? + bx Fe 


gives the integral formula 


b 1 


When X is defined by (2) and b? — 4ac Æ 0 and n ¥ 1, differentiation and sim- 
plification give 


(8) d lax +b _ _ 2a(2n — 3) _ (n — 1)(0 — 4ac) 
dx X* 1 pe. Fn 
and hence 
1 2ax + b 2a(2n — 3) 
9 + dgy = — —— “AT? 
( ) J X” dx (n — 1)(%? — 4ac) X”! (n — 1) (3? _ 4ac) f Yri dx 


The formulas (7) and (9) are examples of reduction formulas that sometimes 
enable us to express given integrals in terms of other integrals that are more 
easily evaluated. Persons who like everything in mathematics can easily become 
interested in (9) and similar formulas that appear in books of tables. But the 
formulas are rarely used (most people never use them), and with only a twinge 
of regret we decline to invest our good time in consideration of examples more or 


less like f(x? — 5x — 1)? dx. 
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2 Supposing that p, g, r are three different constants, determine 4, B, C 


such that 
1 


-pe-e 


4, 8 , ¢ 


x—-p x—g@ 


xr 


and use the result to obtain the integral with respect to x of the left member. 


Ans. . 


l ; log le — p| + 


(L-9) -r 


ee 
(1 — p-r 


y log |x — ql] 


log |x — r| +c. 


1 
tope- i 


3 Supposing that p Æ q, determine 4, B, C such that 


1 _ A B C 
@ NG) 2-—p GP ea 
and use the result to obtain the integral with respect to x of the left member. 
Ans.: 
- gps le — pl tot tt tog le - al + e 
(4 — 2)? —px—-—p (9-9)? 


4 Obtain the answer to Problem 3 by starting with the tricky calculation 


1 1 
(x — p)*(x — 4) 


= l|  ¢@-p _ 1 @~?)- 
17 Pa pe g) 


(x — q) 
q—p (x — p)*(* — 4) 


- 3 le=ne=a Gl 


5 Assuming that 9, q, and r are different constants, find the partial fraction 
expansions of the following quotients and check your answers. 


(2) = 
x 

9 E=NE-D 
(e) ——._—*_____.. 

(x — 1)(x — 2)(x — 3) 
e) —— * 
8’ G@ — Dia — 2) — 3) 
O) i 

(x — 1)(x — 2)? 

6 Show that i 

f t(1 + 1)? 

7 Show that 
(1) iz 


O =a 
ETET) 
O Gope- Ne 
® GE DEA 
D GETT 
dt = log 2 — L. 


2 


D ty, me E. 
o FDA 6 
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Remark: Determination of constants 4, B, C, D for which 


1 _Ax+B,Cxt+D 
(x? + 1)(x? + 4) verti x?+ 4 

or 

(2) 1 = (Ax + B)(x? + 4) + (Cx + D(H? + 1) 


can be made in various ways. Particularly efficient work can be done by those 
fortunate persons who know about the algebra of complex numbers, includ- 
ing the “imaginary unit” i for which i? = —1. Putting x = í in (2) gives 
1 = (4i + B)\(3), so 4 =0 and B =4. Putting x = 2i in (2) gives 1 = 
(2Ci + D)(—3), so C = 0 and D = —§. Hence 


1 1 1 1 il 


ae ë O ë 


+ DeE+tA 327+1 32°44 


a result that is easily checked. 
8 Obtain the partial fraction expansion of the left member of the formula 


x ieta- _ letiet., 
(x + 2)* (x + p)§ (x + p)° 


by taking advantage of the broad hint in the formula. 
9 We should not be too busy to see how Euler determined the constants in 


x? _ A B C Dx + E 
1) ASFA o O—-w fox? TF 


in his great textbook “Introductio in Analysin Infinitorum,” Lausannae, 1748, 
volume 1, page 31. Clear of fractions to obtain 


(2) x? = A1 +x?) + BL — x)(1 + x?) 
+ C(1 — x)*(1 + x?) + (Dx + E\)(1 — x). 


Put x = 1 to obtain 4 = 4%. Subtract $(1 + x”) from both members of (2) 
and divide by (1 — x) to obtain 


(3) —3x — 4 = BI + x?) + C(1 — x)(1 + x?) + (Dx + E)(1 — x)?. 
Put x = 1 to obtain B = —%. Add $(1 + x?) and divide by (1 — x) to obtain 


(4) —gx = C(1 + x°) + (Dx + E)(1 — x). 

Put x = 1 to obtain C = —%. Add $(1 + x?) and divide by (1 — x) to obtain 
(5) ił — įłx = Dx + E, 

and hence D = —4, E = Ł, 


10 One who wishes to think about a partial fraction problem in arithmetic 
(or theory of numbers) may seek integers 4 and B for which 


ll £ B 
p73 ts 
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11 Partial fraction expansions have their principal applications in electrical 
engineering and elsewhere where complex numbers (numbers of the form a + 1b, 
where a and b are real and z is the imaginary unit for which 72 = —1) are system- 
atically used. When complex numbers are used, it is not necessary to bother 
with factors (like x? + 1 and x? + x + 1) that cannot be factored into real 
linear factors. Theory and applications can be based upon a version of the 
fundamental theorem of algebra which says that if 


(1) P(x) = a” + ax! + +++ taux t+ an, 


where n is a positive integer and a) Æ 0, then there exist & different numbers 
x1, X% °° * , Xk (which are not necessarily real) and k exponents 1, px, °° - 
px (which are positive integers) such that 


3 


(2) Pit pet s+ t+pran 
and 
(3) P(x) = a(x — xı)? (x — %2)P? e. (x _— Xk) Pe. 


This theorem is a simple corollary of general theorems that appear in the theory 
of functions of a complex variable. Less sophisticated but more complicated 
proofs can be given. The numbers xı, %2, © * - , x, are said to be zeros of the 
polynomial of multiplicities pı, po, °° * , px It can be proved that if Q is a 
polynomial of degree less than that of P, then Q(x)/P(x) is representable in the 
form 


Q(x) _ Ay Ais Ay,3 oo. Aip 
O PA G-a eo @—at E G apn 
Aon Ao» Ass oi. Ae ps 
+ (x — x2) + (x — x)? + (x — x2) + + (x — x)”: 
+- 
Akı Ark.2 Ax,3 a. Arp: 
+ (x — xr) + (x — x)? + (x — x)? + + (x — xy)”: 


where the pı constants 41,1, 41,2, ° * * ,4ı,pı “go with the powers of the factor 
(x — x;),” the bz constants 421, 42,2, °° * , 42, “go with the powers of the 
factor (x — x2),’? etcetera. Supply and demand do not generate heavy traffic 
in proofs of this partial fraction theorem, but we can pause to look at the three 
potent identities 


©) x — a S(x) x—a S(x) 

(6 1 R@) 1 (x — a) — (x — b) R(x) 
) (x — a(x — b) S(x) b~a (x—a(x— b) S(x) 
nd 

° x R(x) 1 d(x — a) — alx — b) R(x) 


(7) G— au —b Sx) ha ae Se) 
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in which it is supposed that a and b are real or complex numbers for which b xq. 
For example, use of (7) gives 


— ooo oaa F o 
D Gae E a) ~G@—a@—m) e aa mp 
— Lee — x) — x(x — x2) x’ 
— xa — x (x — x1)(x — x2) (x — x)(x — x3)3 


1 [ X x) ] x4 
 ¥9— %1LxX — x> x — xid (x — x) (x — x3) 
Xo x4 xı x4 


x2 — xı (x — x)? (x — x3)  xz— 1 (% — xı) (x — x)(x — x)? 


Even when Q(x) and R(x) are values of functions that are not polynomials, (5), 
(6), and (7) enable us to express quotients as sums of simpler quotients. It is 
quite easy to see that the partial fraction theorem can be proved by repeated 
applications of (5), (6), and (7). Moreover, it is sometimes better to use (5) 
and (6) and (7) than to use other methods for obtaining partial fraction expan- 
sions. Textbooks in algebra show that if the coefficients ao, a1, °° * , apin 
(1) are real, then the zeros of the polynomial “come in conjugate pairs.” This 
means that if p + iq is a zero for which p and ç are real and g 0, then p — iq 
is another zero. On account of this fact the right side of (4) can, when P is real, 
be represented as a sum of terms of the forms 


9 B D Gx 
9) @—O” FEFA" F Er FA 

where m is a positive integer and the coefficients in the denominators are all 
real. The quadratic denominators are all real. The quadratic denominators 


arise because if Hı and He are constants, then there exist other constants H; and 
H, for which 


(10) H, H: H: + A yx 


Ő D o 
— e 


In most practical applications of this material, the numbers Dis Po s+ + > Dk 
defined above are all 1 and (3) and (4) reduce to the much simpler formulas 


(11) P(x) = aox — x)(x — x2) - © © (x — xn) 
Q(x) A As An 
(12) P@)"z-mtzomt tTa 


Except when P(x) has only real zeros, we must use complex numbers to achieve 
simplicity. 


9.5 Integration by parts In this book, and elsewhere in the scientific 
world, we frequently encounter situations where effective use can be 
made of the formula for integration by parts. Assuming that u and v 
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are functions having continuous derivatives over intervals appearing in 
our work, the formula which can be put in the forms 


d d d 

aw sus bos d(uv) = u du + v du 
d d d 

“= = = oT u dv = d(uv) — v du 


implies the formula which can be put in the forms 


d 
(9.51) feiu- [oes f uws w- fodau 


(9.52) f uwo dx = u(x)o(x) — | eu) dx 


which involve the notations of Leibniz and Newton. We can prefer 
to use (9.52) when meanings of symbols are being explained and to use 
(9.51) when problems are being solved and the abbreviated notation 
expedites our work without confusing us. 

We have already seen some of the reasons why the formula (9.51) or 
(9.52) for integration by parts is useful. As was pointed out following 
(8.483), efficient use of the formula is made by writing 


(9.53) u = u(x), dv = v'(x) dx 
(9.54) du = u'(x) dx, v = fo'(x) dx = (x), 


where u(x) and v'(x) are chosen in such a way that the product u(x)v (x) 
is the integrand in the integral we wish to study. The integral of the 
product of the things in (9.53) is then the product of the things on the 
main diagonal minus the integral of the product of the things in (9.54). 

The formula for integration by parts has so many applications that it 
is quite hopeless to undertake to tell when and how it is useful. In 
many (but not all) situations, the formula is useful when u and o’ are 
chosen in such a way that fo’(x) dx is an elementary integral and the 
integral fo(x)u’ (x) dx is simpler than the integral fu(x)o’(x) dx. Our 
examples and problems will provide some ideas and information. Mean- 
while, our guiding principle merits repetition. If we want to learn some- 
thing about an integral and other methods fail to be helpful, we try 
integration by parts. Before turning to examples, we make a final 
observation. In order to apply the formula (9.52) for integration by 
parts, we need just one pair of functions u and v for which u(x)v’ (x) is a 
given integrand. It is therefore not necessary to insert an added con- 
stant cı of integration when we write a function v whose derivative is v’. 
One who wishes to do so may see what happens when we replace o = —677 
by v = —e-* + cı in the following example. There are relatively few 
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situations in which matters are simplified by inserting a cı which is 
different from Q. 


Letting 


(9.55) I, = fxe—* dx, 


we set 
“=X, dy = e~ dx 


du = 1 dx, v = fe dx = =e 


and conclude that 
Ty = ~x + fe dx = —xe — e= +c. 


The same idea is useful when n is a positive integer and 


(9.56) In = [xe dx. 
Setting 
u = x", dv = e dx 
du = nx”! dx, o = fe dx = —e* 


gives the reduction formula 
In = — xe + nfa le dx 
which expresses J, in terms of I„—ı. In particular, 
fxe dx = =x? — Ixe* — 2e + ce. 


If is a positive integer and 


(9.57) Jn = Sx” sin x dx, 
setting 
“= x", do = sin x dx 
du = nx"! dx, v = f sin xdx = — cos x 


gives the formula 
Jn = —x" cos x + nfx"—! cos x dx. 


If n = 1, the last integral is easily evaluated. In case n > 1, we can 
integrate by parts again. Setting 


u = xr), dv = cos x dx 
du = (n — 1)x*-? dx, v = f cos x dx = sin x 
gives 
Jn = ~x" cos x + nx"! sin x — n(n — 1)fx"-? sin x dx. 


In particular, 


fx? sin x dx = —x? cos x + 2x% sin x +2 cos x + c. 


9.5 Integration by parts 521 


We could feel that integration by parts would not enable us to simplify 
f log x dx, but we can set 


= log x, dy = 1 dx 


du=+dx, 4 
x 


x 


to obtain 
J log x dx = x log x — fl dx = x log x — x + c. 


This result is easily checked by differentiation. 


Problems 9.59 


1 Derive the following formulas by one or more integrations by parts 

1 

(a) f xe dx = ¢ — =) eo + ¢ 
2 2 

(b) [ = log x ax = F log x -5 +c 

xrtl 1 
O fx log x dx = =T TONERI 

2 
(d) [ wee ae = x SPE ect +e 
(e) J = sec? x de = x tan x + log cos x +c 
(f) f (log x)" dx = x(log x)” — n f (log x)” dx 
log x x log x . 
© | oi 41 =yq1 ~lset)te 
(h) f in x de = x sinx + vl -x+c 
(1) J tan™! x dx = x tan! x — $ log (1 +2) +c 
2 Setting u = 77, do = (1 — t)! dt, derive the formula 
f ; (1 — i) dt -2 j) ' tP] — zat! dy 
0 q +1jo l 
Observe the fact that the result agrees with the beta integral formula 
1 plq! 
f t(l — ż)t dt = TETESI 


and the formula r! = r[(r — 1)!]. Remark: In Problems 18 and 19 of Problems 
13.49, we shall introduce the Euler gamma integral formula 


z! = I tzet dt (z > —1) 


and show how it is used to derive the Euler beta integral formula. 
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3 Derive the formula 
fx” cos x dx = x” sin x + nx”! cos x — n(n — 1) fx"? cos x dx. 


Use this formula to find fx? cos x dx, and check the result by differentiation. 
This formula and those of the next two problems are reduction formulas. In 
some Cases, useful results are obtained by making repeated application of them. 
4 Letting 
I(p,q) = f sin? x cos? x dx, 


where p and g are constants for which p = —1 and p + g ¥ 0, show that 


‘pti gl 
I(p,9) = ro + ore i if sin?t? y cost? x dx 


and 
sin?*! x cos™ 1x  g—] 


ls a ee i -2 
I(p,q) = a+q tF sin? x cos??? x dx. 


Hint: Start by writing u = cost! x, do = sin? x cos x dx. 
5 Letting 
W (p,q) = J sin? x cos? x dx, 


where p and g are constants for which g = —1 and p + q ¥ 0, show that 
sin?! x costly p—i a 49 
W (t,q) = — ~; Fi tF] sin?—2 y cost? x dx 


and 


sin?! x costy | p — 1 


W (0,9) = >F? P po sin?™? x cos? x dx. 


Hint: Start by writing u = sin?-! x, do = cos? x sin x dx. 
6 Supposing that p is an integer for which p = 2, show how a result of the 
preceding problem can be used to obtain the formula 


x/2 p ee D 
(1) J sin?” x dx = 7 o sin?™? x dx. 


Supposing that n is a positive integer, show how repeated applications of (1) 
give the famous Wallis (1616-1703) formulas 


x/2 | 2n — 1 2n — 3 531 ¢ 
an -p e e © © =æ @ a © oa ô =e 
(2) J, sin® x dx = Fo a5 64°33 
x/2 2n 2n — 2 6 42 
inertl = ———_. e o o men ome 
(3) J, sintet! æ dx 2n +1 2n—1 75 3 


Observe that 


(4) 2n(2n — 2) - - - 6-4-2 = 2"! 
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and show that multiplying the numerators and denominators of the right mem- 
bers of (2) and (3) by the left member of (4) gives the formulas 


z/2 (2n)! x 
fed 2n — I‘ 
(5) J, sin?" x dx = Fon tnl 3 
T/2 . 22ny In! 1 
2n+1 = Le 
(6) Í, sinti x dx = l In Fl 


One reason for interest in these things lies in the fact that the number 


(2n)! 


2nnin! 


is the probability of finding exactly n heads and exactly n tails when 2” coins are 
tossed. Remark: We embark on a little excursion to see that these formulas 
have startling consequences. When 0 < x < 2/2, we have 0 < sin x < 1, so 


(7) 0 < sin*t! x < sin?” x < sin l y <1 
and hence 
r/2 . r/2 x/2 . 
(8) 0 < f sint! x dx < Í, sin?” x dx < I sin?! x dx. 


Putting p = 2n + 1 in (1) gives the formula 


r/2 
(9) f sin?! x dx = (1 + 5) fe sin2"+1 y dx. 


It follows from (8) and (9) that there is a number 6, for which 0 < ĝa < 1 and 


x/2 
(10) Í, sin?” x dx = (1 + =) i sin2™t1 y dx. 


Multiplying the members of (5) and (6) gives 
r a/2 
(11) | I, /2 sin?” x dr | pi / sin2*t] x dx | = 
tn G- + Zn (143) 


Multiplying the members of (10) and (11) leads to the formula 


/2 [ 1-0. vz 
(12) f sin?” x dx = |1 m Fiza 


Substituting this in (5) gives the formula 


(2n)! _ l— 0 1 
Zan In! On +1 Snr 


(13) 


Since 0 < 6, < 1, the first factor in the right member is near 1 when 7 is large 
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and is quite near 1 even when x is as small as 4or 3. Thus, even when 7 is quite 
small, the number 1/+/nzm is a very good approximation to the probability of 
finding exactly n heads and exactly n tails when 2n coins are tossed. This 
astonishing result has very significant consequences, and persons who comprehend 
it may say that mathematics was never so beautiful before, and mathematics 
really is the Queen of the Sciences. 

7 Derive the formula 


az(a sin bx — b b 
(1) fe sin bx dx = ete sin be — B cos br) +c 


in the following way. Set 


u = eF, do = sin bx dx 
to obtain 
(2) J e°? sin bx dx = — Aost + 7 f e°? cos bx dx. 
Then set 

u = sin bx, do = e dx 
to obtain 

GZ Q3 b 

(3) f etz sin bx dx = om sin 0 -z J e°? cos bx dx. 


Then combine (2) and (3) to obtain (1). Now derive (1) in the following way. 
Let J denote the left members of (1), (2), and (3). Set 


u = 6, dv = cos bx dx 
to put (2) in the form 
e cos bx afe% sinbx a 
9 pm EL GI 


Then solve (4) for J and obtain (1). 
8 Derive the formula 


e77(b sin bx + a cos bx) 


f e17 cos bx dx = z F b +c. 


~9 Sketch graphs of y = e and y = log x in the same figure. Let R, be 
the region consisting of points (x,y) for which x $0 and O S y S e7. This 
unbounded region R; is said to be bounded by (or to have boundaries) the x axis, 
the y axis, and the graph of y = e7. Let R, be the region consisting of points 
(x,y) for which x = 0 and y S 0 together with points (x,y) for which O < x S 1 
and log x Sy $0. This region Rz is said to be bounded by the x axis, the y 
axis, and the graph of y = log x. Observe that R, and Rz are congruent regions. 
Use the fact that R, and Re possess areas |R;| and |R| for which 


0 1 
|R| = lim f e-* dx, |R| = lim Í (— log x) dx 
h—> ~ œ ho JÈ 


to show that [R,| = |R} = 1. 
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10 Evaluate the integral in 


3 
x 
p= [fe 
1 + x? 
in two different ways, and make the results agree. First, use the identity 


o% o olte] O ; x 
View Vite "Vite ~ Te 


Second, integrate by parts with 
u = x?, do = x(1 + x2) dx. 
11 With the aid of the substitution (or change of variable) V/a — x = t, 


show that 
N x*Va—-xdx =2 ine (a — 27)%2 di = Toga. 


Show that the first integral can be evaluated by integration by parts. 
12 Derive the formulas 


no] no] 
Í gtl di = leel — [ te #2 dy 
x T 
no] no] 
Í EPI? dt = gle 22 — 3g 4 3 Í 4t) dt. 
z z 


Remark: Formulas of this nature give useful information, the integrals on the 
right being small in comparison to those on the left when x exceeds 2 or 3 or 4. 

13 Many problems in pure and applied mathematics involve “best fit” or 
“best approximation” in some sense or other. We can start picking up ideas by 
observing that if X is near 1l and 


() fle) = cos x, g(x) = (1- 


T 


then the graphs of f(x) and Ag(x) over the interval —r/2 S x S w/2 look much 
alike. The graph of Ag(x) is the best fit in the sense of least squares to the graph of 
f(x) when À is chosen such that 


2) Ph E) — MG) de 
is a minimum. Show that (2) will be a minimum when 
63) a O UOR de = SI NOA) ax 


and hence when À = 30/r? = 0.967546. Remark: Sketching graphs of f(x) 
and g(x) on a rather large scale indicates that g(x) > f(x) when 0 < x < 7/2. 
In fact, we can put 


(4) F(x) = g(x) — f(*) 


and show that F(0) = F(r/2) = 0, F is increasing over the part of the interval 
0 < x < 2/2 for which sin x > (8/m)x, and F is decreasing over the remaining 
part of the interval. 


Polar, cylindrical, 
10 and spherical 


coordinates 


10.1 Geometry of coordinate systems We begin with a glimpse of a 
(or the) major reason why polar coordinates should be studied. In 
many problems involving functions defined over all or portions of &;, 
there is a line L which is particularly significant. This line L may, for 
example, be an axis of symmetry or a wire carrying an electric current 
or charge. When we want to use coordinates, we can let the line L 
be the z axis of a rectangular x, y, z coordinate system as in Figure 10.11. 
When we are interested in the cylinder consisting of points at a particu- 
lar distance po from the z axis, we can correctly describe this set as 
being the set of points having rectangular coordinates x, y, z for which 
x? +. y2 = pè. Itis, however, much simpler to take the distance p from 
the z axis to a point P to be one of the coordinates of P so that the equa- 
tion of the cylinder is simply p = po. This one coordinate p is, however, 
526 
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not enough to determine the position of a point P. It turns out to be 
convenient to determine the position of P by use of p, the angle ¢, and 
the rectangular coordinate z of Figure 10.11. The three numbers p, 
ġ, z are called cylindrical coordinates of P. 


Figure 10.11 Figure 10.12 


In many other problems involving functions defined over all or por- 
tions of £3, there is a point Po (instead of a line L) which is particularly 
significant. This point Po may, for example, be the center of a spherical 
or nonspherical earth or may be a point at which an electric charge is 
supposed to be concentrated. When we want to use coordinates, we 
can let Po be the origin O of a rectangular x, y, z coordinate system as in 
Figure 10.12. When we are interested in the sphere consisting of points 
at a particular distance ro from the origin, we can correctly describe this 
set as being the set of points having rectangular coordinates x, y, z for 
which x2 + y? + 2? = 7}. It is, however, much simpler to take the 
distance r from the origin to a point P to be one of the coordinates of P 
so that the equation of the sphere is simply r = ro. This one coordinate 
r is, however, not enough to determine the position of a point P. It 
turns out to be convenient to determine the position of P by use of r, 
the angle ¢, and the angle @ of Figure 10.12. The three numbers r, 4, 
ð are called spherical coordinates of P. 

We really should write P,(x,y,z), P.(p,¢,z), and P.(r,¢,0) to denote, 
respectively, the points having rectangular coordinates x, y, z, cylindrical 
coordinates p, ¢, z, and spherical coordinates r, ¢, 6. The coordinates 
x,y, Z, P, $, 7, 6 are numbers, and it is precarious to confuse relations among 
coordinates and functions of these coordinates by allowing P(0.7,0.6,0.5) 
and f(0.7,0.6,0.5) to have ambiguous meanings. The subscripts are 
included in Figures 10.11 and 10.12 because they should be included. 
To be reasonable about this matter, we can allow P(a,b,c) to be the point 
having cylindrical coordinates a, b, c while we are solving a problem or 
constructing a theory in which cylindrical coordinates and no other 
coordinates appear. We eliminate confusion, however, by agreeing that, 
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except in special situations where there is an explicit agreement to the 
contrary, P(a,b,c) is always the point having rectangular coordinates 
a, b, c. 

In ordinary useful applications of the cylindrical coordinates of Figure 
10.11, it is always supposed that p 2 O and it is sometimes supposed 
that p > 0 and —r <¢@ <S r. Sometimes the latter restriction on ¢ is 
removed so that ¢ can vary continuously as a particle having cylindrical 
coordinates (p,¢,z) makes excursions around the z axis. In those situa- 
tions in which it is supposed that p > 0 or p 2 0, the graph in cylindrical 
coordinates of the equation ¢ = ġo is a half-plane (not a whole plane) 
having an edge on the z axis. In those situations in which ¢ is unre- 
stricted, a point P does not determine its cylindrical coordinates because 
the two points 


P.(p, $ + Dnr, 2), P.(p, $, z) 


are identical when n is an integer. In ordinary useful applications of 
spherical coordinates, it is always supposed that r 2 O and is sometimes 
supposed that r > 0,0 <8 Sr, and —r < ġ Sr. In ordinary geo- 
graphical terms the coordinate 6, which is 0 when P.(r,¢ġ,8) is at the north 
pole and is 7/2 when P,(7,¢,6) is on the equator and is m when P,(r,¢,6) 
is at the south pole, determines the latitude of P,(7,¢,0). The coordinate 
¢@ determines the longitude. 

Partly because the endeavor helps us to understand cylindrical and 
spherical coordinates, we turn to the study of polar coordinates of points 
in a plane. The basic idea behind the concept of polar coordinates is 
both simple and attractive. Suppose we are located at a point O, an 
origin or pole, in a plane and we wish to give explicit instructions telling 
how to make a pilgrimage to a point P in the same plane. We begin by 


p constructing a half-line O4 with an end at O as in 


p Figure 10.13 and calling this half-line the initial line 

$+r from which angles are to be measured. In case P is 
$ not the origin, instructions for reaching P are now 

O A easily given. Start at the origin looking in the direc- 
Figure 10.13 tion of the initial line, turn in the positive (counter- 


clockwise) direction through the angle ¢ until facing P, 
and then travel the appropriate distance p from O to P. We could (and 
sometimes do) end the matter here and say that p and ¢ are the polar 
coordinates of P,(p,¢), the point having polar coordinates p and ¢. 
Sometimes we restrict ¢ to the domain —x < ¢ < r and end the matter 
in another way. While recognition of the fact is sometimes irksome, it is 
nevertheless true that if n is an integer which may be negative as well 
as positive or zero and if we turn through the angle 2mm + ¢, then we 
will be facing toward P and can travel the distance p to reach P. When 
we take this possibility into account, we find that, for each » = 0, £1, 
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+2, °° , the numbers p, ¢ + 2m constitute a set of polar coordinates 
of P. We could (and sometimes do) end the matter here. It is not 
always easy to know when we are being wise, but we can recognize one 
more possibility. After turning through the angle ¢ +7 or ¢ —7 or 
@ + (2n + 1)x, where n is an integer, we will be facing away from P 
and we can reach P by going backwards a distance p. When we take 
this last possibility into account, we find that, for each n = 0, +1, +2, 
- , the numbers (— p, ¢ + r + 2nm) constitute a set of polar coordi- 
nates of P. When polar coordinates of this variety are permitted to 
appear in our work, we abandon the idea that p is a distance and take p 
to be a coordinate that can be negative. Thus when p <0, the point 
P,(p,¢) having polar coordinates p, ¢ is the same as the point P,(\pl, 
@ + rt) having the more normal polar coordinates lol, +r. We still 
have to consider the polar coordinates of P when P is the origin O. It 
turns out to be best to agree that, for each num- 
ber ¢, the numbers O and ¢ are polar coordinates 
of the origin.f 
Let polar and rectangular coordinate systems be 
superimposed in such a way that, as in Figure 10.14, 
the initial line of the former coincides with the non- 
negative x axis of the latter. When P is a point Figure 10.14 
different from O, it is easy to obtain formulas 
relating the rectangular coordinates (x,y) of P and each set (p,#) of polar 
coordinates of P for which p > 0. The definitions 


cos ¢ = =, sin @ = 


DIR 
oD j 


of the trigonometric functions give the formulas 
(10.141) x = pcos ¢, y = p sin ¢ 


which uniquely determine x and y in terms of p and ¢. On the other 
hand, the formulas 


_ _ x . _ Yy 
004D p= vty C8 = zaya NO = ery 


uniquely determine p in terms of x and y and uniquely determine an 
angle o such that —r < ġo S r and ¢ must have the form ġo + 2nz, 
where n is an integer. In case p> 0 and —r/2 < < 2/2, the last 


t In this chapter, the coordinates p, ¢, 7, 0 have the classical significance they usually 
have in mathematical physics and elsewhere when Legendre polynomials and such things 
appear. Some textbooks use r and 6 for polar coordinates and, with a shift in meaning of 
coordinates, use 7, 6, ¢ for spherical coordinates as we do. Sometimes p is used for a spheri- 
calcoordinate. Persons who stray from one book or one classroom to another do not always 
appreciate modifications of classical notation, but they are rarely if ever seriously injured. 
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two of the formulas can be replaced by the single formula ¢ = tan`! y/x. 
These formulas enable us to make substitutions which transform formulas 
involving coordinates of one brand into formulas involving coordinates 
of the other brand. 

The remainder of the text of this section is devoted to development of 
the art of sketching polar coordinate graphs of given equations for which 
it is more or less appropriate to operate without restricting p to non- 
negative values and without restricting ¢ to an interval such as the 
interval —r < ġ Sr. In this situation, the relation between polar 
graphs and equations is complicated by the fact that each point has an 
infinite set of polar coordinates and we need a definition. The polar 
coordinate graph of an equation of the form f(p,¢) = O is the set S con- 
sisting of points P each of which has at least one set (p,) of polar coordi- 
nates for which f(p,¢) = 0. The polar graph of the equation p = —] 
is then the unit circle C with center at the origin because P,(—1,¢) is, 
for each ¢, the same as the point P,(1, ¢ + r). The polar graphs of the 
two equations p = —l and p = 1 therefore coincide, 
even though there is clearly no single pair (p,¢) of 
numbers for which the two equations are simultane- 
ously satisfied. Figure 10.143 can promote under- 
standing of this matter. When we allow p to be 
negative, the polar graph of the equation ¢ = 0 is 
more than the initial half-line of the polar coordinate 
system; it is the entire line upon which the initial 
half-line lies. 

Supposing that a is a given positive constant, we undertake to deter- 
mine the nature of the polar graph of the equation 


(10.15) p = asin 2¢ 


Figure 10.143 


without laboriously locating many points. Observing that |p| Sa 
always and |p| = a sometimes, we draw the circle of radius a with center 
at the origin to help us. Our knowledge of the sine tells us that sin 2¢ 
increases from 0 to 1 and p increases from 0 to a as 2¢ increases from 0 
to x/2 and hence as ¢ increases from 0 tox/4. This information enables 
us to sketch the first part of the first leaf, or loop, of Figure 10.151. 
Similarly, p decreases from a to 0 as 2¢ increases from 2/2 to m and hence 
as @ increases from 7/4 to x/2. Now we complete the first leaf. Con- 
tinuing to decrease, p decreases from 0 to —a as 2¢ increases from r to 
34/2 and hence as ¢ increases from w/2 to 34/4. During this operation, 
the terminal side of the angle ¢ is in the second quadrant and negativeness 
of p throws the graph into the fourth quadrant to give the first half of 
the second leaf. Then p increases from —a to 0 as 2¢ increases from 
3x/2 to 2m and hence as ¢ increases from 3x/4 to x. This gives the 
second half of the second leaf. Continuing its increase, p increases from 
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0 to a as 2¢ increases from 27 to 57/2 and hence as ¢ increases from r to 
Sr/+. This gives the first half of the third leaf. Three more increases 
in 2¢ and ¢ complete the third and fourth leaves as @ increases to 2r. 
Increasing @ beyond 2r yields more curve but no more graph, since the 
graph is retraced. The full graph is shown in Figure 10.151. In the 
good old days, perhaps before clover was invented and when roses were 
primitive, someone called this graph the rose with four leaves. 


= 
2 


Figure 10.151 Figure 10.152 


The polar graph of the equation 
(10.16) p = a(l + cos ¢) 


is obtained much more easily. As @¢ increases from 0 to r and then to 
2r, p decreases from 2a to O and then increases to 2a. ‘This graph, which 
is called a cardioid, is shown in Figure 10.152. 

Let a be a positive constant. When 0 < ¢ < 72/2, it follows from 
elementary geometry and trigonometry that the point P having polar 
coordinates (p,¢) for which 


(10.161) p = acos @¢ 


lies on the circle of Figure 10.162. Consideration of other angles shows 
that the circle is the complete graph (which, of course, 
means the graph) of the equation. 


A bit of novelty appears when we undertake to a 
sketch a graph of the equation SEX 
(10.17) Z 


p? = a* cos 2¢. rex 
As 2¢ increases from —7/2 to 0 and then to 7/2, and ay 
hence as ¢ increases from —2/4 to 0 and then to 


Figure 10.162 
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a/4, a? cos 2¢ increases from 0 to a? 

and then decreases to 0. ‘This infor- 

mation enables us to sketch the two 

loops of Figure 10.171, p being posi- 

tive on one loop and negative on the 

Figure 10.171 other. As 2¢ increases from 7/2 to 

3/2, and hence as ¢ increases from 

1/4 to 3r/4, cos 2¢ is negative and no values of p are obtained. Further 

investigation shows that the graph already drawn is complete. It is 
called a lemntscate. 

When a > 0, the polar graph of the equation p = a¢ is called a spiral 

of Archimedes. This graph is shown in Fig. 10.181, the part for which 


Figure 10.181 Figure 10.182 


@ <0 being dotted. When spirals are being graphed, and at some 
other times, the approximations t = 3.1416, 7/2 = 1.5708, 7/4 = 0.7854, 
and similar others are used. It is very often necessary to know rela- 
tions akin to the relations 2r radians = 360°, m radians = 180°, 2/2 
radians = 90°, and 7/4 radians = 45°. It is sometimes useful to know 
that 1 radian is 180/r degrees or about 57 degrees, but degrees and min- 
utes and seconds play minor roles in our work. The polar graph of the 
equation p = ¿° is an exponential spiral which is commonly called a 
logarithmic spiral. The graph is shown in Figure 10.182. When a > 0, 
the dotted part for which ¢ < 0 spirals inward around the origin. 


Problems 10.19 


1 With the aid of Figure 10.11, show that the formulas giving the rectangular 
coordinates x, y, z of a point in terms of the cylindrical coordinates p, $, z of the 
same point are 


x = pcos œ, y = psin ¢, Z=% 
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2 With the aid of Figure 10.12, show that the formulas giving the cylindrical 


coordinates p, $, z of a point in terms of the spherical coordinates 7, $, 6 of the 
same point are 


p =rsin 0, $ = ọ, % = r cos l. 


3 With the aid of Problems 1 and 2, show that the formulas giving the 


rectangular coordinates of a point in terms of the spherical coordinates of the 
same point are 


x = r cos ¢ sin 6, = r sin @ sin 6, % = r cos l. 


4 Transform the following equations from rectangular to polar coordinates 


(a) at +y? = a" 


Ans: p=a 
(b) x =a Ans: pcos @ =a 
(c) (cos a)x + (sin a)y = a Ans.: pcos (¢ ~ a) =a 
(d) xy =1 Ans.: p? sin 2¢ = 2 
(e) x? — y? = a? Ans.: p? cos 2¢ = a? 


5 In Problem 31 of Section 6.4, we said that the rectangular graph of the 
equation 


(a? + y? = è — yt) 


is a lemniscate. Show that the polar equation is p? = a? cos 2ọ. The graph 
appears in Figure 10.171. 

6 Lemniscates have a simple geometric property. Let b be a positive num- 
ber and let F, and F; be the points (sometimes called foci) having the rectangular 
coordinates (—b,0) and (2,0). Let S be the set of points P for which 


(1) IFP] [F2P| = 2°. 
Show that the rectangular equation of S can be put in the form 
(2) (x? + y?) = 2b7(x? — y?) 


and hence that Sis alemniscate. Show, in one of the various possible ways, that 
the polar equation of this lemniscate is 


(3) p? = 2b? cos 2¢. 


7 Transform the following equations from polar to rectangular coordinates. 


(a) p=3 Ans.: x2 + y =9 
(b) p?= a? sin 2¢ Ans.: (x? + y?)? = 2a*xy 
(c) p = a cos $ Ans: x? + y? = ax 
(d) p = 2a(1 — cos 9) Ans.: (x + y? + 2ax)? = 4a?(x? + y?) 


8 Show that when 
p = 2a cos ¢, 


the point with polar coordinates (p,@) runs once in the positive direction around 
a curve C as ¢ increases from —7/2 tom/2. Show, in one way or another or in 
more than one way, that C is the circle of radius a having its center at the point 
with rectangular coordinates (2,0) 
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9 Sketch polar graphs of the equations 


(a) p = 3 + 2 cos ọ (b) p = 3+ £ cos > 

(c) p = a sec $ (d) p = tan ¢ 

(e) p = a sin 3¢ G) p=asin$ 

(z) p = 1/¢,(¢$ > 0) (k) p = 1/(1 + $°), ($ > 0) 


10 Sketch rectangular graphs of y = vV cos x, y= cos x, and y = cos? x in 
one figure, and then sketch polar graphs of p = v cos ¢, p = cos ġ, and p = 
cos? @ in another figure. Partial solution: Good polar graphs of p = cos ¢ and 
p = </cos ġ and the right-hand half of the graph of p = cos? $ are shown in 
Figure 10.191. 


SO 
S 


No- LN - 
~90° -80° -70° -60° ~ 50° 
Figure 10.191 


11 Sketch rectangular graphs of x? + y? = 1 and x? + y? = 25 in one figure 
and then sketch polar graphs of p? + $? = 1 and p? + œ? = 25 in another figure. 

12 In case pı > O and p2> 0, a straightforward application of the law of 
cosines gives the formula 


d? = pi + pz — 2pipe cos ($: — Qı) 


for the square of the distance d between points having polar coordinates (p1,¢1) 
and (p2,¢2). Show that the formula is also valid when pı £ 0 or p2 < 0 or both. 

13 This is a rather heroic problem that requires careful and accurate applica- 
tions of rules for differentiation and attention to algebraic details. No vectors 
or figures or tricks of any kind are to be used; just differentiate and substitute. 
Using the formulas 


x(t) = p(t) cos (t), y(t) = p(t) sin (t) 
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and the formula (7.389) for curvature in rectangular coordinates, show that if 
p and ¢ are functions having two continuous derivatives and if the point P(t) 


having polar coordinates p(t), $(¢) traverses the curve C as ¢ increases, then the 
curvature K of C at P(t) is 


— PEPE I? + eO OAA — PO ep + 2[0’()]?6'@) 
o (e + [p(¢)]?} 


provided the denominator is not zero. 


14 Show that letting #(t) = ż and replacing t by ¢ in the last formula of the 
preceding problem gives the much simpler formula 


g = PC) + 2['(4)]*? — (4)p”(¢) 
(Lo()]? + le$) 


for the curvature of the graph of the polar equation p = p(@) oriented in the 
direction of increasing @¢. 
15 The two equations 


(1) Vix? +y? + [al + yl = a 
(2) Vii + prety =a, 


in which a is a given positive constant, have respective graphs G, andG, Show 
that the polar coordinate equations of these 
graphs are 


_ a 
9 P=IFiin $] + [eos gl 


and 

| _ a 

(4) P =I Ẹsin 6 + cos @ 
or 


a 


"1+ Visin(6 +7) 


with p > 0. Obtain more or less complete 
information about G, and Go». 

16 The cissoid of Diocles is the set of 
points P(x,y) obtained in the following way. 
Leta >0Q. Asin Figure 10.192, let C be the 
circle with center at (4,0) having radius a. 
LetO0 < x < 2a. Let Q; and Q: be points on 
the upper part of C having x coordinates x 
and 2a — x. Then P(x,y) is the intersection 
of the line OQ. and the vertical line through Figure 10.192 
Qı. Letting @ be the angle which OQ, 
makes with the x axis and with the line Q.Q,, we see that y = a when x =a 
and, when x =Æ a, 


(1) tang=2= 1Q.PI Iove- e-a 
x 


[Q-Q] 2(x — a) 
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This gives 
(2) (2a —x)y = x Vx(2a — x), 


and squaring gives 

3 
(3 2= _$* _. 
\ ) y 2a — x 
The graph of this equation, including points for which x = a, x = 0, and y < 0, 
is the cissoid. We now come to the polar coordinate problem. With the aid of 
the fact that Qz has polar coordinates (:,@) for which pı = 2a cos @¢, try to use 
Figure 10.192 to derive the polar equation of the cissoid. If unsuccessful, use 
(2) and formulas which give x and y in terms of p and ¢. Ans.: 


(4) p = 2a sin ¢ tan @. 


Remark: Diocles employed the cissoid to “duplicate a cube,” the problem being 
to start with some line segment of length x (the length of an edge of a particular 
cube) and construct a segment of length -//2 x (the length of the edge of a cube 
having double the volume of the original one). The line L through the points 
(a,2a) and (2a,0) has the equation (2a — x) = y/2 and intersects the cissoid at 
a point (x,y) for which y3 = 2x3 and hence y = y2 x. What Diocles really 
wanted to do was duplicate a cube by ruler-and-compass construction. This 
has been proved to be impossible. It is possible to construct the line L with a 
ruler and compass, and it is possible to construct points on the cissoid one by 
one with a ruler and compass. The reason why Diocles failed to accomplish 
his purpose should be explained. Life is too short to enable us to produce ruler- 
and-compass constructions of all of the points on the cissoid, and there is no way 
to prescribe rules for a ruler-and-compass construction of the particular point 
where L intersects the cissoid. 

17 The conchoids of Nicomedes provide a method (but not a ruler-and-com- 
pass method, because no such method exists) for trisecting angles. Let p and g 
be given positive numbers. Let O (the pole of the conchoid) be the origin and 
let L (the directrix of the conchoid) be the line having the rectangular equation 
x = p and the polar equation p cos @ = p or p = p sec @ as in Figure 10.193. 
The conchoid consists of two parts or branches. When —r/2 < œ < 7/2, the 
line OM of the figure meets the line L at the point M and meets the far branch 
(the branch most remote from the pole) at a point P whose distance from M is 


q and whose distance from the origin is p sec 6 + g. The polar equation of the 
far branch is therefore 


(1) p = psec ọ +g. 


The same line OM meets the near branch (the branch nearer to the pole) at a 
point P’ whose distance from M is g and whose polar equation is 


(2) p = p sec $ — q. 
The equation 
(3) p=psecotg 


or the equation 


(4) (p — p sec $}? = q? 
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is the polar equation of the conchoid The graph consisting of the two solid 
branches nearest the line L is a conchoid for which qg < p. The dotted graph 
consisting of the two outer branches is a conchoid for which g > p. It can be 
observed that finding the polar equation of the conchoid was no problem; the 
equation came as the conchoid was being defined. Now comes the problem. 
By direct use of Figure 10.193 or, alternatively, by using (4) and transformation 


l 
I 
Conchoids H 
1 / 
i i 
Í / 
l / 
Figure 10.193 Figure 10.194 


formulas, find the rectangular equation of the conchoid which applies to the 
primed coordinate system for which the x’ and y’ axes are the x axis and the 
line having the equation x = p. Ans.: 


(5) xy’? = (g? — x) + p)?. 


Remark: Conchoids are interesting examples of graphs of quartic equations, that 
is, equations of the form f(x,y) = g(x,y), where f and g are polynomials in x and 
y one of which has degree 4 and the other of which has degree not exceeding 4. 
To trisect the given angle 4OP of Figure 10.194 with the aid of a conchoid, let 
M be the point at which the line OP intersects the line Z, and let a = IOM |. 
Let C be the far branch of the particular conchoid for which g = 2a. Let P; 
be the point at which the horizontal line through M intersects C and let M, be 
the point at which the line OP; intersects the line L so that |M,P;| = 2a. To 
begin our attack upon angles, let @ be the angle 4OP, and let 6 be the angle 
P,OP, so that the given angle 0P is $6 + 6. Applying the law of sines to the 
triangle OMP, gives the first of the equations 


a _ |MP,|| a _ 2acos ọ 
sin @ sinl sin $ sin 6 


and the second follows because |MP,| = 2a cos ġ. Thus 


sin 0 = 2 sin @ cos @ = sin 24, 
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so 6 = 2¢ and the given angle AOP is 3m. ‘Thus the line OP, trisects the given 
angle. It is possible to use ruler-and-compass constructions to locate many 
points on the conchoid C, but it is impossible to give explicit instructions for 
producing the particular point P; needed for trisection of the given angle. 

18 Let O bea point on a circle of diameter a, and let b be a positive number. 
A set S of points P (a limagon of Pascal) is determined in the following way. 
If Z is a line through O and if Q is either the second point in which Z intersects 
the circle or is O itself if Z is tangent to the circle, then S contains the two points 
on L at distance b from Q. Sketch some figures and investigate these limaçons. 
Remark: With suitable coordinates, the equation can be put in the forms 


p =a cos l t b, p = a cos ĝ + b, p =b — a cos @. 


The fact that cos (0 + r) = — cos @ is important. 

19 Let a and b be positive constants. Let Fı and F be two points having 
polar coordinates (a,r) and (a,0) and rectangular coordinates (—a,0) and (a,0). 
The set S of points for which |F,P| |F,P| = b? is called an oval of Cassini. Investi- 
gate these ovals. Remark: If b > a (read “if b is much greater than a”), then 
S is closely approximated by a large circle. If b = a, then S is a “figure eight” 
which is, in fact, a lemniscate; see Problem 6. Ifb <a (read “‘if b is much smaller 
than a”), then S consists of two small ovals that are closely approximated by 
small circles. 


10.2 Polar curves, tangents, and lengths As our discussion of 
coordinate systems may have indicated, polar coordinates can be par- 
ticularly useful in situations where dis- 
tances from an origin are particularly 
significant. It turns out that the polar 
equation of a conic is exceptionally neat 
and attractive when we put the conic in 
the “standard position.” As in Figure 
10.21, let the focus and directrix of a conic 
K having eccentricity e be placed upon a 
polar coordinate system in such a way that 
Figure 10.21 a focus is at the origin and the directrix is 

perpendicular to the initial line and inter- 
sects the extended initial line at the point having polar coordinates 
(~,x). The intrinsic equation of the conic K, which first appeared in 
our work in (6.23), is then 


(10.22) IFP] = e|PDI. 


While it can be presumed that we know something about conics and 
can proceed without the result, it is nevertheless interesting to use the 
intermediate-value theorem to prove that if m/2 < |¢o| Sz, then 
there iS exactly one point Po with polar coordinates (po,¢0) for which 
[FPo| = e|DoPol. In any case, we consider only values of p for which 
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p > Q, so there can be no possible objection to use of Figure 10.21. The 
intrinsic equation (10.22) then gives 


p = e(p + pcos ¢). 


When cos ¢ < 1/e, we can solve for p to obtain the important “‘standard 
form” equation 


(10.23) p ep 


l — e cos ọ 


To illustrate the fact that polar equations are used as sources of infor- 
mation, we proceed to study (10.23). When e <1, the condition 
cos ¢ < 1/e is satisfied for each @ and the polar graph of (10.23) is an 
ellipse. In this case the point P with polar coordinates (p,¢) runs 
repeatedly around the ellipse in the positive direction as ¢ increases over 
intervals of length 27. In case e = 1, the condition cos ¢ < 1 means 
that ¢ is not an integer mutiple of 27, and the polar graph of (10.23) is a 
parabola. In this case P,(p,¢), the point having polar coordinates p 
and ¢, runs in the positive direction along arcs of the parabola as ¢ 
increases over subintervals of the interval 0 < ¢ < 2r. In case e > 1, 
the restriction cos ¢ < 1/e is more severe, and the polar graph of (10.23) 
is one branch of a hyperbola. In this case, we confine ¢ to the interval 
do < ¢ < 2r — go, where do is the first-quadrant angle for which 
cos ġo = l/e. The point P,(p,g) runs over arcs of the hyperbola as ¢ 
increases over subintervals of the interval ġo < ¢ < 2r — go. Because 
particles moving along one branch of a hyperbola do not suddenly jump 
to another branch, and for other reasons, we are usually content to work 
with only one branch of a hyperbola. We are, therefore, usually not 
interested in the fact that if ¢ is an angle for which |¢| < ¢o so that 
e cos ¢ > 1, then the formula (10.23) determines a negative number p 
and the point P,(p,¢) lies on the other branch of the hyperbola. As 
Figure 10.21 indicates, each conic K intersects the axis of K at a vertex 
V between the focus and directrix. Putting ¢ = r in (10.23) shows that 
the distance from F to V is ep/(1 + e). In case e = 1, this reduces, as 
it should, to p/2. Incase e = 1, another vertex Ñ” is obtained by setting 
@ =0. The polar coordinates of V’ are ep/(1 — e) and 0. In case 
e < 1l, our formulas give 


ep eh _ p, 


010.24) |VP"| = PFI + EP = I + Te 


The center of the conic lies midway between the vertices, and it is a 
straightforward matter to continue this investigation to obtain additional 
information. 

In connection with a conic or other curve C having a manageable polar 
equation, it is of interest to have information about the tangent to C 
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at a point P on it and, in particular, to have information about the angle 
y (psi) between this tangent and the vector running from the origin to 
P. The most informative way to attack these and related questions is 
by use of vectors; in fact it is not improbable that, in the long run, experi- 
ence gained by working with vectors may be more valuable than informa- 
tion about y. We may start with a curve having the polar equation 
p = f(¢), where f is supposed to have a continuous derivative. We 
may suppose that a particle P moves along C in such a way that its 
polar coordinates at time ¢ are f(¢(t)) and (t). When this is so, we 
can set p(t) = f((t)) and say that P has polar coordinates p(t) and ¢(t) 
at time t. We now free ourselves from the supposition that p was a 
function of ¢ in the first place, and we consider the general situation in 
which a particle P has polar coordinates p(t) and ¢(¢) at time t£. When- 
ever we wish to do so, we can reduce our work to the special case simply 
by setting $(¢) = t, but it is very much worthwhile to handle the more 
general situation. Moreover, we can still further increase the appli- 
cability of our work by studying a still more general situation. 

One who needs the medicine can free himself from the notion that 
matters have become mysterious by supposing that P is a bumblebee or 
electron that is buzzing around in £3 in such a way that its cylindrical 
coordinates at time ¢ are p(t), (t), z(t). One who wishes to consider 
only polar coordinates can put z(ż) = 0 at all times. The rectangular 
coordinates x(t), y(t), z(t) are determined in terms of the cylindrical 
coordinates by the formulas 


(10.25) x( = p(t) cos (t), = v4) = p(t) sin ġ(t),  z@) = 22). 


Thus, in terms of the standard unit vectors i, j, K, the vector r(¢) run- 
ning from the origin to P at time ż is 


(10.26) r(t) = p(t)[cos ¢(t)i + sin ¢(j] + z()K. 


In all of the following work, we suppose that ¢ is confined to an interval 
over which p, ¢, and z are functions having continuous derivatives. 
Let C be the curve (or arc) traversed by P as t increases over this interval. 
Differentiating (10.26) gives the formula 


(10.261) v(t) = p’(2)[cos o(2)i + sin ¢(Hj] 
+ p(t)¢’(2)[— sin olti + cos ¢(4)j] + 2’@k 


for the vector v(t) which is the velocity of P at time ¢ and is also the 
forward tangent to C at P. This can be put in the form 


(10.262) V(t) = pP (Ault) + p(t)’ (tut) + z (e)k 
where 

ui(t) = cos ¢(2)i + sin o(4)j 
(10.263) | u(t) = — sin ¢(¢)i + cos ¢(t)j. 
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The vector U(t) is the unit vector in the direction of the projection of 
the vector r(t) upon the xy plane. The vector u,(t) is easily seen to be 
another unit vector, and it is orthogonal to both u(t) and k because 
Ui(t)-U2(t) = 0, k-u,(t) = 0. 
Moreover, as we see by introducing vector products, 
i j k 
u(t) X u(t) = cos ¢(t) sin g(t) 0| =k, 
— sin ¢(t) cos d(t) 0 


so the three vectors u,(ż), U2(t), K, in this order, constitute a right- 
handed orthonormal system of vectors. Figure 10.27 shows these things 


x 
Figure 10.27 


for the special case in which C, and hence the vector r(t), lies in the xy 
plane of the paper and the unit vector k therefore extends vertically 
upward from the plane of the paper. Use of the right-hand finger and 
thumb rule shows that Figure 10.27 would be wrong if the direction of 
the vector u(t) were reversed, and this shows that our excursion into 
E; helps us to see how things are oriented in the plane. 

Thus, (10.261) and (10.262) are remarkably simple and informative 
formulas which display the scalar and vector components of v(t), the 
velocity vector or the forward tangent vector, in terms of the three 
orthonormal vectors u(t), W2(t), and k. The orthonormality of these 
vectors enables us to use (10.26) and (10.261) to obtain the formulas 


(10.271) rOl = vle + [z6] 

(10.272) wO = VOR + OLH + [e’@P 

(10.273) r(t)-v(t) = p(t)p’(t) + 2(4)2’(2). 

The angle y between the vectors r(t) and v(t) can, when |r(z)| = 0 and 
|v(£)| = 0, be calculated from the basic formula 


(10.274) r(t)-v(t) = rO| vE] cos ¥. 
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For the case in which z(t) is identically zero and p(t) > 0, this gives the 
formula 


10.275 = a OT 
( ) cos p TOE + [p(t) p (4) ]? 


In case p(t) > 0, ¢’(t) > 0, and p’(t) # 0, this enables us to prove the 
first formula in 


— PHS G) ~ pf. 

(10.276) tany = uC tan y =p dp ~ dp 
dg 

In appropriate circumstances, the second formula follows from the first. 


In accordance with conclusions reached at the end of Section 7.2, we 
can put (10.272) in the form 


(10.277) s(t) = VOR + Ep OP + e), 


where s(t) is the coordinate at time ¢ which is obtained by measuring 
distance along the curve C. When z(t) = O for each ¢ and ¢(t) = t so 
¢ (t) = 1, this is often put in the form 


2 
(10.278) 2 = lp + (35) . 


Matters relating to lengths of curves are of sufficient interest to 
justify close scrutiny of the following theorem. 


Theorem 10.28 If pand ¢$ are functions having continuous derivatives 
over a St S b, then the integral in the formula 


b 
(10.281) L= f VP Or + POS OF at 
1s the length of the curve C consisting of the ordered set of points P having 
polar coordinates (p(t), o(t)) for whicha St S b. 
The simplest proof of this theorem is obtained by setting 


(10.282) x(£) = p(t) cos e(t), = y(t) = p(t) sind), z) = 0 


in the formula (7.26) which was thoroughly discussed and proved in 
Section 7.2. The formulas 


(10.283) x’(t) = —p(t) sin o(t)¢’(t) + cos (ÐP (2) 
(10.284) y(t) = p(t) cos (p (A + sin $(t)p’(t) 
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enable us to convert (7.26) into (10.281). For the special case in which 
g(t) = tand ¢’(t) = 1, it is standard practice to put (10.281) in the form 


B dp 2 
(10.285) L=] Je + (35) dé 


in which the variable of integration is @ and the limits of integration are 
called a and 8 (instead of a and b) because a and 8 “look more like angles.” 

It is worthwhile to know a little trick by which the above formulas 
involving polar coordinates can be remembered. We can look at Figure 
10.286 which shows, among other things, an arc of length As joining two 


Figure 10.286 


points P and Q which have polar coordinates p, ¢ and p + Ap, ¢ + Ad. 
We can feel that the outer part of the figure resembles a rectangle enough 
to enable us to write the approximate formulas 


| -a/p hg? T Lp -— AP _ 
(10.287) As= Vp Ag? +A, cosy = ae 


and expect that correct results should be obtained by dividing by At 
or by A¢ and taking limits. We can know that this optimism does not 
prove formulas, but it can help us to recall the formulas when we have 
forgotten them. When we wish to calculate the length L of the curve 
having the polar equation p = f(¢) with a S ọ S $, we can, when f is 
continuous, sketch a figure more or less like Figure 10.286 and use the 
optimistic calculation 


(10.288) L = lim X As = lim ` ~V p? Ad? + Ap? 


2 B 2 
= iim) fet + (32) Ad = i qe? + A dé 


to lead us to the correct formula (10.285). 
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Problems 10.29 


1 Obtain the standard polar equation of the conic K and use it to sketch 
the major and minor (or conjugate) axis of K when the eccentricity e and dis- 
tance p from the focus to the directrix are 


(a) ¢=3,p =6 (b) e =2,p = 3 
2 Make a hasty sketch of the cardioid having the polar equation 
p = 1 + cos @. 


Show that tan y = —(1 + cos ¢)/sin @ = — cot ġġ. Calculate tan Ų when 
$ = 7/2 and when ġ = m and make any repairs in your figure that this informa- 
tion may require. 

3 Find the length of the cardioid of the preceding problem. Ans.: 8. 

4 There is something unique about angles at which radial lines from the 
origin intersect the exponential spiral having the polar equation p = °. What 
isit? Ans.: The angles are all equal. 

5 As @ increases from 0, the point Pp(p,@) on the polar graph of p = e> 
spirals around the origin. Since r is about 3 and e’ is about 20, the point P 
spirals toward the origin so rapidly that the length of the whole path may be not 
much greater than the distance from the starting point to the origin. What are 
the facts? Ans.: To try to preserve good ideas and perhaps create more, put the 
matter this way: If @ starts at time ¢ = 0 and increases at a constant rate, the 
point P must keep moving forever, but its speed decreases so rapidly that the 
total distance traveled is always less than ~Z and only approaches +/2 as t — œ 
and ġ — œ. It makes sense to say that the total length of the path is +/2. 

6 Let C be the polar graph of p = f(ġ), where f(0) = 1, f(2r) = 2, and f is 
continuous and monotone increasing over the interval OS @ < 2r. Try to 
decide whether it is easy or difficult or impossible to prove that C must have 
finite length. 

7 The curve C of the preceding problem lies between the polar graphs of the 
equations p = 1 and p = 2. Try to decide whether it is easy or difficult or 
impossible to prove that the length of C lies between the lengths of the inner and 
outer circles. 

8 Let the displacement vector of a particle P at time t be 


(1) r(t) = p(t)[cos P(t)i + sin ¢(t)j), 


where it is supposed that p and @ have two derivatives. Forgetting formulas 
which we have derived but remembering rules for differentiating products, derive 
the velocity and acceleration formulas 


(2) v@) = p’()[cos p&i + sin (il) + p(e)o’@I[— sin lei + cos ¢()i] 
(3) a) = [o"t — plo E) llos $¢)i + sin $¢)j] 
+i” E) + 20’) $’@)IL— sin (i + cos $()j]. 


Observe anew that the two vectors 


(4) [cos (i + sin P(#)j], [— sin d(¢)i + cos d(2)j] 
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are orthonormal vectors because they are unit vectors and their scalar (or dot) 
product is zero. Henceforth we consider only time intervals over which p(z) > 0. 
When motion of the particle P is being considered, the first vector in (4) is said 
to be radial because it has the direction of the “radius vector” from the origin 
to P, and the second vector is transverse because it is orthogonal to the radius 
vector. Thus the right member of (3) displays, in order, the radial and the 
transverse vector components of the acceleration of P. This acceleration is said 
to be radial (the kind produced by a “central force field” having its center at 
the origin) when its transverse component is zero, that is, 


(5) p(t) @" (2) + 2p’) 6’(t) = 


This is another one of those derivative equations that is called a differential 
equation and from which information can be extracted. Since p(z) > 0, the 
left member of (5) is zero if and only if the product of it and p(t) is zero, that is, 


(6) OPE A + 2p(z)p’(t) o's) = 0. 
The virtue of (6) lies in the fact that it can be put in the form 


(7) = {Ole} = 0, 


and this fact should be carefully checked. The virtue of (7) lies in the fact that 
it holds over an interval of values of ż if and only if there is a constant ¢ such that 


(8) 2lp(s)l?$"(t) = c 


for each ¢ in the interval. The physical significance of (8) will be revealed in 
Section 10.3; it is an important fact that (8) holds if and only if the radius vector 
from the origin to the particle P sweeps over regions of equal area in time intervals 
of equal lengths. 

9 A circular race track has cylindrical equations p = a and z = 0, and it 
has rectangular equations x? + y? = a? and z = 0. A wheel of radius b rolls, 
without slipping, around the track. The center of the wheel is always above the 
track, it travels with constant speed, and it makes a circuit of the track in T 
minutes. At time ¢ = 0 the center of the wheel is going in the direction of the 
positive y axis, and a pink tack P in the tire on the wheel is at the point having 
rectangular coordinates (a,0,0). Letting r,(¢) denote the vector running from 
the origin O to the center Q of the wheel at time #, show that 


(1) rı) =a (cos = i + sin 27 i) + bk. 


Letting x(z), y(t), z(t) denote the rectangular coordinates of the pink tack P at 
time ż, show that 
2rat\ 


(2) z(t) =b (1 — cos Far 


Using the fact that the line PQ (a spoke of the wheel) is perpendicular to the 
horizontal line from Q to the z axis, obtain the formula 


(3) x(t) cos am + y(t) sin a =a 
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which may help us determine x(t) and y(¢). Using the fact that the distance 
from P to Q is always b (the radius of the wheel), obtain the formula 


2 2 2 
(4) [z0 — a cos m] + [xo — asin ad = b? sin? TF 
Show that 
(5) [x()] + [y(i] = a? + b? sin? ma, 


With or without assistance from these formulas, derive the formulas 


Irt . 2rat 


x(t) = a cos“ + b sin sin Te 


_ art ; 2at . 2rat 

y(t) = a sin y — b cos Fr sin Far 
2rat 
z(t) = b — bcos =r FT 


and use them to find the velocity and acceleration of P when ż = 0. Remark: 
Mechanisms involving rolling wheels (or gears) appear in machinery in various 
ways, and we have the preliminary idea that we can start studying these things. 

10 Let abea positive number. The point P lies on a line through the origin 
which intersects the line having the rectangular equation x =a at a point Q, 
and IPO] i is equal to the distance from Q to the x axis. The set of such points P 
is a strophoid. Find the polar equation of the strophoid. Ans.: 


p? cos ¢ — 2ap + a? cos ọ = 0. 


II Supposing that a > 0, prove that the line having the rectangular equation 
y = ais an asymptote of the hyperbolic spiral having the polar equation pẹ = a. 

12 Supposing that a > 0, prove that the x axis is an asymptote of the lituus 
having the polar equation p*¢ = 

13 Show that transforming the first of the equations 


(1) p = 4a cos @ — asec 9, x4 + xy? + ay? — 3ax? = 0 


from polar to rectangular equations gives the second. Remark: The graph of 
these equations is a trisectrix of Maclaurin. It is possible to use a formula for 
tan 3¢ to show that if O is the origin, if Q is the point (2a, 0), if Pis a point on the 
trisectrix in the first quadrant, if 6 is the angle in the interval 0 < 0 < m which 
the line OQ makes with the positive x axis, and if ¢ is the acute angle which OP 
makes with the positive x axis, then 6 = 3¢. Thus the trisectrix provides a 
method (but not a ruler-and-compass method) for trisecting angles. Particu- 
larly when 0 S$ Æ $ < 1/2, the number pin (1) has an elegant geometric interpreta- 
tion. It is [OB| — |OA| where A and B are the points where a line through O 
meets the line having the rectangular equation x = a and the circle of radius 
2a having center Q. 
14 The innocent graph of the equation 


i 
7 T4+2 
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is called the witch of Agnest because Maria Gaetena Agnesi (1718-1799) discovered 
a spooky ruler-and-compass method for constructing points on it. Let C be 
the circle of diameter 1 with center at (0,$). Let T bea point on the line tangent 
to the circle at the point (0, 1), and let Q be the point, different from the origin O, 
at which the line OT intersects the circle. Show that the vertical line through T 
and the horizontal line through Q intersect at a point on the witch. 


10.3 Areas and integrals involving polar coordinates Problems 
involving “areas in polar coordinates” can be formulated in different 
ways. To begin, we suppose that f is a nonnegative integrable function 
of @ over some interval a S$ ¢ < B for which BS a + 2r. Let S be 
the set of points having polar coordinates p, @ for which a S ¢ SB 
and O Sp < f(¢). In case f is continuous, S may be described as the 
set S bounded by the polar graphs of the equations ¢ = a, ¢ = $, and 


VY (df, (op) 


Figure 10.31 


p = f(¢). The schematic Figure 10.31 should not be misleading. To 
find the area |S| of S, we make a partition of the interval a S$ ¢ S 8 
into subintervals of which a representative one of length Ag, contains 
a particular ý. When there is a necessity for being precise about this 
matter, we set Ady = ox — r-ı and choose ¢* such that ¢:-1 S oF S 
r- The area of the subset of S containing points for which ¢x-1 S $ S 
$k can now be approximated by the area 4[f(@7)]? Ad of the sector of 
radius f(¢*) having central angle Ady. When we are hurried, we need 
not make the usual remarks about the way in which the approximation 
depends upon the choice of ¢ž, but in any case we should know why we 
are using the area of the sector instead of its perimeter. An application 
of fundamental ideas about estimating, summing, and taking limits to 
set up integrals then gives 


(10.32) [S| = lim ) 30DE Ae = $ f; UON do, 


where the integral is a Riemann integral. When we are not required 
to write detailed explanations of the reasons for doing what we do, we 
replace (10.32) by the simpler calculation 


(10.33) ISI = lim ) $1f(@)? de = $f" LAI? de 
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in which the subscripts and star superscripts do not appear. In applica- 
tions, we often write p or p(¢) in place of f(¢). 

To add variety to our acquaintance with problems, we define two 
numbers J; and J» by the formulas 


(10.34) = Ff" OOd h= fOr Ol a 


and ask how J; and J». may be interpreted in terms of polar coordinates 
and areas when żı < ta. In order that the integrands and integrals exist, 
it is necessary that p and @ be functions for which p(t), (t), and ¢/(2) 
exist when żı St S te, and we simplify matters by supposing that p, 
¢@, and @¢’ are all continuous over #; S t S tə} As t increases over the 
interval #; St S ts, the point P(t) having polar coordinates (p(t), $(t)) 
traces a curve (or an oriented curve) C from P(é) to P(ts) that could look 
like that shown in Figure 10.35 or like that shown in Figure 10.36. 


Figure 10.35 Figure 10.36 


Of course, there are other possibilities, and we exclude more elaborate 
ones by supposing that the interval t; S ¢ S t can be separated into a 
finite set of subintervals such that, over each subinterval, ¢ is either 
monotone increasing or monotone decreasing. Let P be a partition of 
the interval #; S t S ¢ such that, in each subinterval, (t) is either 
monotone increasing or monotone decreasing. Let tř be chosen in the 
kth subinterval in such a way that 


(10.37) Plik) — Alir) = (E) (k — ter) = Q(t) At, 


and build the Riemann sum 
(10.38) Dalf (te) 26’ (tÉ) Ate 


which approximates J. A particular term in this sum is an approxima- 
tion to the area of the region swept over by the vector from O to P(t) 
as ¢ increases from ¢_-1 to t provided (t) > $(t—1), that is, provided 
the vector rotates in the positive direction. Similarly, the term is an 
approximation to the negative of the area if the vector rotates in the 
negative direction. Thus J; is the sum of areas of regions swept over 
in the positive direction and the negative of areas of regions swept over 
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in the negative direction. We never have negative areas, but we can 
subtract areas because areas are numbers. In case the path of P(t) is 
the closed curve C of Figure 10.36, J is the area of the region bounded by 
C. If the path of P(t) is the curve obtained by reversing the direction 
of the arrows in Figure 10.36, then J, is the negative of the area of the 
region enclosed by the curve. Except in cases where the vector from 
O to P(t) always rotates in the same direction, the number Z% in (10.34) 
is usually less interesting than J,. It is the sum of the areas of all of the 
regions swept over by the rotating vector. 


Problems 10.39 


1 Find the area of the region bounded by the cardioid having the polar 

equation 
p = a(l + cos ¢). 
Ans.: 3ma?/2. 

2 Using the polar equation p = 2a cos @ of a circle of radius a, and noting 
that a particle with polar coordinates (p,p) traverses the circle once as @ increases 
from —r/2 to 7/2, work out the familiar formula for the area of the circular 
disk bounded by the circle. 

3 The graph of the polar equation p? = a? cos 2¢ is the lemniscate shown 
in Figure 10.171. Find the area of the bipartite set which it bounds Ans.: a?. 

4 Find the area of the region bounded by the graph of p = ~/cos ġ. Ans.:1. 

5 Supposing that a > 1 and that n is a positive integer, find the area of the 
region bounded by the polar graph of the equation 


p = a + cos nọ. 
Ans.: ma? + 7/2. 

6 Use integration to find the area 4, of the smaller region which the line with 
rectangular equation y = x slices from the interior of the circle having the polar 
equation p = 2a cos @. Then calculate 4; from the fact that the interior of 
the circle is the union of the interior of an inscribed square and four slices each 
having area 4;. Make the results agree. 

7 Sketch a polar graph of the equations 


p =4+ cost, =F sint 


and make a rough estimate of the area of the region enclosed by the graph. Then 
calculate the area. 

8 Set up an integral for the area 4 of the region bounded by the ellipse 
having the standard polar equation 


ep 


P =I Zz coso 


and show that the result can be put in the form 


T 1 


= h? —_— m aaħŮĖÁĂ 
A=} o (e — cos ¢)* 


ad. 
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9 It is much easier to learn to play a violin than to acquire competence to 
give basic definitions and theorems involving areas of patches of curved surfaces. 
Some problems are so simple, however, that elementary methods yield answers 
that are universally considered to be correct. As Figure 10.391 suggests, a 


Figure 10.391 


hemisphere (or hemispherical surface) of radius a is generated by rotating a 
quadrant of radius a about the y axis. To find the area 4 of this hemisphere, 
we make a partition of the interval 0 S$ ọ S7/2. Rotating the segment of 
length a A¢@ about the y axis gives a part of the hemisphere that can be roughly 
described as a ribbon of width a Ad@ and length 2rr, where r is a cos @ ‘The 
process for setting up and evaluating integrals then gives 


A= lim > 2ra? cos 6 Ad = 2ra? fe cos ọ dọ = 21a’. 


These preliminaries can be ended with the remark that the area of the hemi- 
sphere ought to be about double the area of the equatorial circular disk and that 
the world is so simple that the factor is exactly 2. Now comes the problem. 
Supposing that Oc <c+4ASa, find the area of the zone generated by 
rotating (about the y axis) the part of the arc between the lines having the equa- 
tions y = c and y = c + h. 

10 Supposing that r(#) and @(¢) have continuous derivatives and that 


x(t) = r(t) cos $(), y(t) = r(t) sin $@), 
calculate x’(t) and y’(¢) and show that 
x(t)y’(t) — y(¢)x’(@) = [r@]?¢’@). 


11 Making use of the method involving (10.38), put appropriate hypotheses 
on functions x(t) and y(t) and discover geometric interpretations of the integrals 


[Eror f IOO a. 


12 Show that if a particle P moves on the conic having the polar coordinate 
equation 


a 
(1) p(t) = I — z cos 6) 
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in such a way that @ is a function of t having two derivatives, then the displace- 
ment vector and velocity of P are 


(2) rë) = ERETO [cos $(t)i + sin $(2)j] 


(3) v¢) = —eag'(t) sin (Zt) 


[1 — e cos ¢(#)]? [cos #(Hi + sin SOI] 


4 ee [~ sin o(¢)i + cos (j). 


For the case in which 


(4) zeA = c, 


so that the radius vector from O to P sweeps over equal areas in time intervals 
having equal lengths, show that 


2 
6) #@ = oor = Bll — e cos 0) 
(6) v(t) = — sin $@[cos ei + sin H(i] 

+ a [1 — e cos $(t)][— sin (t)i + cos $(4)j] 
(7) a@)=- xe mar [cos O(2)i + sin P(2)j]. 


Remark: This shows that if a particle P moves along a conic in such a way that 
the radius vector from the focus to P sweeps over equal areas in time intervals 
having equal lengths, then the particle is always accelerated toward the focus 
and the magnitude of the acceleration is inversely proportional to the square 
of the distance from the focus to the particle. Kepler discovered that, except 
for minor perturbations, the planets move in ellipses with the sun at a focus 
and that the “equal areas” property holds. As we did in this problem, Newton 
used these laws to derive his famous inverse square law of gravitation. 

13 Let S be a bounded convex set in the xy plane and let the origin O be an 
inner point of S. (Basic definitions are given in a remark at the end of Problems 
5.19.) Prove that to each @ there corresponds exactly one positive number 
}(@) such that the point P having polar coordinates (f(), @) lies on the boundary 
B of S. Solution: Choose positive numbers 6 and R such that the circular disk 
with center at O and radius 26 is a subset of S and S is a subset of the circular 
disk with center at O and radius R. Let @ be a given angle. Let f(ġ) be the 
least upper bound of numbers po such that S contains each point having polar 
coordinates (p,@) for which 0 < p < po. Then 26S f(¢) SR. Let P be the 
point having polar coordinates (f(@), $) and, as in Figure 10.392, let Qı and Q2 
be the points of tangency of the lines through P tangent to the circle of radius 
ô having its center at the origin. With the possible exception of the point P 
itself, each point of the line segment OP is a point of S. Since Qı and Q» are also 
points of S and since S is convex, it follows that each point inside the triangle 
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Q:PQ2 is a point of S. No interior point Q of the shaded sector of Figure 10.392 
can be a point of S. Otherwise, the point P, being an inner point of the inside 
of the triangle Qi19Q2, would be a point of S and so would also each point of some 


Figure 10.392 


circular disk having its center at P. There would then be a number po greater 
than f(@) for which S contains each point having polar coordinates (p,@) for which 
O < p S po, and this is impossible. We now know that each point inside the 
circle with center at the origin and radius ô is a point of S, that each point inside 
the triangle Q,PQ, is a point of S, and that each point Q interior to the shaded 
sector is not a point of S. It follows from this that f(@) is the one and only 
positive number such that the point P having polar coordinates (f(¢), p) is a 
point on the boundary B of S Remark: Without going into details we observe 
that this proof provides supplementary information that enables us to relate 
f(@ + h) to f(ġ) It can be shown that f is continuous and hence that the 
boundary B becomes a simple closed curve C when its points are so ordered that 
the point having polar coordinates (f(#1), $1) precedes the point having polar 
coordinates (f(@:), @2) when 0 S go: < $2 S 2r. The objection that curves 
were defined in terms of rectangular coordinates is overcome by the observation 
that if f is continuous, then so also are the functions g, and ge defined by 


x = gi(p) = fd) cos > 
y = g:($) = f(P) sin ¢. 


It can be proved that the curve C has finite length L. 


1 1 Partial 


derivatives 


11.1 Elementary partial derivatives Leaving consideration of more 
complicated situations to later sections, we confine attention in this sec- 
tion to examples and problems in which the fundamental ideas can be 
stated quite simply and it is relatively easy to be completely sure of the 
meanings of all of the symbols that are used. We begin with an example. 
Suppose a copper rod occupies the interval xı S x S x2 of an x axis and 
that we are interested in the temperature u (measured in degrees centi- 
grade) at points of the rod at various times t. To be precise about the 
matter, we may suppose that the “space coordinate” x is measured in 
centimeters with x = 0 at some “space origin” and that the “time 
coordinate” ż is measured in seconds with ż = 0 at some “time origin” 
which could be the time at which some particular stop watch was started. 
In some problems, it is not presumed that x and ¢ are positive. For 
present purposes, we suppose that to each pair of numbers x and ż for 
which x; S x S x, and t = 0 there corresponds exactly one number w 
which we may denote by f(x,t) or by u(x,t) and which is the temperature 
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at the “space-time place? having ‘“‘space-time coordinates’ x and z. 
Thus the temperature u is a function of the two “variables” x and z. 
We are now in a realm where ideas of major importance can appear all 
over the place. Some students who are now dubious about the possi- 
bility of doing anything useful or interesting with these functons will, 
in two or three years, have substantial information about the possibility 
of starting with positive numbers a and L and a given function g and then 


determining a positive integer n and constants 4), 42, °° * , An such 
that the particular function u defined by 

` ee 2 
(11.11) u(x,t) = ` Ae P sin x 

k=l 


will be a good approximation to g(x) when ż = 0. In any case, (11.11) 
exhibits important examples (one for each choice of n and 41, Aog °°° 
A,) of functions u to which our work applies. 

We continue study of our example in which temperature u (measured 
in degrees) is a function of x (measured in centimeters) and ¢ (measured 
in seconds). If we wish to study the temperature of the rod at some 
particular time to), we can set ¢ = ¢o and, without bothering to be fussy 
about the distinction between a func- 
tion and values of the function, con- 
sider u(x,to) to be a function of x 
alone. If the graph of u(x,to) versus 
x happens to be that shown in Figure 
Figure 11.12 11.12, we can look at the graph to see 

where the temperature is increasing, 
but it would not be too easy to determine the rate of change of u with 
respect tox. Todo this and get a number of degrees per centimeter, we 
would want to differentiate u(%,to) with respect tox. Thus we are led toa 
very important idea. When v is a function of x and some other variables 
(in our case, just one other variable), it may make sense to assign fixed 
values to all of the variables except x and differentiate the result with 
respect tox. In particular, the idea does make sense when we know what 
these other variables are and, in addition, we know that the resulting func- 
tion of x is a differentiable function of x. When we know what we are 
doing (this is a conservative statement providing for the possibility that 
we may sometimes be puzzled by situations in thermodynamics) the result- 
ing derivative is called the partial derivative of u with respect to x. While 
there are other and more informative symbols for partial derivatives, the 
simplest and most ingenious one is du/dx. The “curly dees” in this 
symbol are unusual Greek deltas, and the symbol is usually read ‘‘par- 
tial of u with respect to x.” 

We are already in a position to understand statements and make cal- 
culations. Partial derivatives are important things that abound in 


? 


u(x, to) 
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books and on blackboards and in notebooks and on scratch pads. Wher- 
ever we find dw/dg, we automatically know that w is a function of q 
and some other variables, we know that fixed values have been assigned 
to all variables except g, and we know that dw/dgq stands for the result 
of differentiating the resulting function of g with respect to g. If w 
is measured in gees and g is measured in haws, then ðw/ðq is measured 
in gees per haw. When we see the formula 


(11.13) u = x? + y? + e7 sinby or u(x,y) = x? + y? + e7 sin by, 


we can compute partial derivatives with respect to x by supposing that 
all variables except x are assigned fixed values, so that they are to be 
regarded as constants when we differentiate with respect to x to obtain 


ð , 
(11.131) — = 2x + ae sin by or u(x, y) = 2x + ae sin by. 


The second one of these formulas involves the standard subscript nota- 
tion for partial derivatives. Similarly, 


ð 
(11.132) D = 2y + be? cos by or — u(x,y) = 2y + be” cos by. 


It should now be apparent that calculating these partial derivatives is 
equivalent to evaluating the limits in 
(11.14) u.(x,y) = lim u(x + Ax, y) — u(x,y). 
Az—+0 Ax 
u(x, y + Ay) — u(x,y). 
Ay 
Taking the partial derivatives with respect to x of the members of 


(11.131) and (11.132) gives 


u,(x,y) = lim 
y> 


2 
(11.141) ee = 2 + ae sin by or Ure(X,y) = 2 + are sin by 
and 

07u z E _ z 
(11.142) TE abe** cos by ': or Uyz( x,y) = abe? cos by. 


Taking partial derivatives with respect to y of the members of (11.131) 
and (11.132) gives 


2 
(11.143) ae = abe** cos by or Uy (x,y) = abe cos by 
and 
2 
(11.144) a L2 — ber sin by or ty (yy) = 2 — be sin by. 


In the above formulas, we have used the formulas 


ð {ðu 07u _ 
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which serve a dual purpose: they provide abbreviations for the expressions 
on the left sides, and they provide meanings for the abbreviations on the 
right sides. It could be supposed that we should insert a comma between 
the subscripts x and y to write Uus, in place of uzy, but it is customary to 
consider the commas to be superfluous. 

The result of setting x = y = 0 in (11.131) is 


(11.146) ôu LO or  u,(0,0) = 0, 
Ox 

and this shows that the “curly dee” notation for partial derivatives is, 

after all, a miserably poor purveyor of information. When we see the 

first formula in (11.146), there is nothing to tell us that u depends upon 

exactly two variables x and y and that 0 is the result of setting x = y = 0 

in du/dx. When we keep the curly dees, it is often necessary to put 


(11.146) in the form 
(11.147) ou 


— = Q 2(0,0 = 0 
3x loo 7”? uz(0,0) 


so the first formula, like the second, can really mean something. As this 
one example may suggest, the curly dees really should be banished from 
the universe because they have the habit of giving incomplete and some- 
times misleading information. There can be no doubt, however, that 
they are so pretty and give information so quickly that they will con- 
tinue to survive and be used. 

Relatively simple fundamental calculations yield formulas in which 
the first two or the second two of the quantities 


(11.15) ae EG alayh ele) 
both appear and will cancel out if we can be sure that they are equal. 
Taking a partial derivative with respect to x is, like putting on our shoes, 
a procedure that is called an operation. Taking a partial derivative with 
respect to y is, like putting on our socks, another operation. The ques- 
tion whether the mixed derivatives in (11.15) are equal is therefore the 
question whether two operations commute, that is, whether the result of 
performing the two operations in tandem (that is, one after the other) 
is independent of the order in which the operations are performed. 
Correct ideas about the problem involving shoes and socks can be 
obtained by experimentation. The remainder of the text of this section 
is devoted to the problem involving partial derivatives. We begin with 
some definitions in which n can be 1 or 2 or 3 or 416 and the variables 


are usually denoted by x, y or x, y, zinstead of x3, %2, * * * , Xa when there 
are only two or three of them. 
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Definition 11.16 A function f of n variables xı, x2, + © © , xn is said 
to have the limit L as x1, x2, © * + , Xn approaches Q1,42, °° ° , a, and we 
write 

lim f(%1,%2, °° * xn) = L 


01,02, ° ° * , Zn 01,02, °°° än 


if to each e > O there corresponds a > O such that 


| f(x1,%8, + * Xan ~ Ll <e 
whenever 


0 < vV (xi — a1)? + (x2 — a)? + + + + 4 (xn — an)? < ô. 


Definition 11.17 A function f of n variables xi, xa + + > , Xn is said 

to be continuous at ai, a2, © ° * , An tf 
lim f(%1,%2, Cn Xn) = f(a», ann ân). 
T1, T2, ° ° * nai, d2, °* * ,an 

We are now prepared to state a fundamental theorem which guarantees 
equality of fyz and fzy whenever these and some other derivatives exist 
and are continuous; the fact that the theorem gives additional informa- 
tion is interesting but less important. 

Theorem 11.18 If u(x,y), us(x,y), uy(x,y), and usy(x,y) all exist and 
are continuous over some circular disk consisting of points (x1,1) for which 
(xı — x)? + (yı — y)? < ô, then uys(x,y) exists and 


(11.181) Uys, y) = Usy(x,y). 


Proof of this theorem is quite tricky because it requires two applica- 
tions of the mean-value theorem 5.52, and the first of these applications 
must be made in a particular special way to produce the required result. 
We shun the curly dees and use the subscript notation so we can know 
what we are doing. To approach the derivative u,:(x,y) about which 
we must learn, we observe that the derivatives in 


(11.182) u(y) = lim Ud + Ay) — ulay) 
Ay—>0 Ay 

(11.183) u(x + Ax, y) = lim pa târg EA) utis n) 
Ay 


exist when |Ax| is sufficiently small. We must prove that the limit in 


(11.184) walay) = lim Wet Ae Y) = ulay) 

Az—0 Ax 
exists and is u,,(x,y). Substituting for the terms in the numerator of 
the right member gives 


(11.185) 
uye(x,y) = lim lim uetan yt Ay) — wat Ae, y) ue y+Ay)tutey), 
Az—0 Ay—0 x Ay 
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Supposing for the moment that y and y + Ay remain fixed like good 
numbers usually do, we define a function ¢ by the formula 


(11.186) p(t) = u(t, y + Ay) — u(t,y). 


The numerator N of the ponderous quotient in (11.185) is then found to 
be the second member of the formula 


(11.187) N = $(% + Ax) — (x) = $'(£) Ax 
= [un(&, y+ Ay) — u.(é,y)] Ax, 


and an application of the mean-value theorem and (11.186) then gives 
the rest of the formula in which ~ is a number between x and x + Ax. 
Substituting the last member of (11.187) for the numerator in (11.185) 
gives 


° ° z A — ks 
(11.188) u(x, y) = lim lim uz($, y + Ay) — us(é,y) 
Azr—>0 Ay+0 Ay 
Since uz(¢,t) is a differentiable function of ż over the interval from y to 
y + Ay, another application of the mean-value theorem gives 


(11.189) Uyz(x,y) = lim lim u,,(&,n), 
Az—0 Ay-0 


where 7 lies between y and y + Ay. Matters are complicated by the 
fact that both ¢ and 7 can depend upon Ay, and the next step is a delicate 
one that demands careful attention in a rigorous course in advanced 
calculus. The fact that the limit in the left member of the formula 


(11.1891) lim Usm) = uzy(&*,y) 
Ay—>0 


exists and the fact that uz, is continuous imply that there is a number ¢* 
for which |x — &*| < |Ax| and the formula holds. Since uzy is contin- 
uous, this and (11.189) give 


(11.1892) Uya(x,y) = lim usy(é*,y) = wey(%,y). 
Az—0 


Thus the limits exist and have the required value. It is not expected 
that the above proof of Theorem 11.18 will be “learned” in this course, 
but we need not be blissfully unaware of the facts that the theorem is 
important and that we could learn very much about partial derivatives 
and limits if we would (as is often done in advanced calculus) invest 
enough time to make a thorough study of the proof. 

Two observations can be made. In the first place, it is easy to insert 
an extra variable z in Theorem 11.18 and its proof to obtain the formula 


(11.1893) Ury(%,V,2) = Uys (x,y,z) 


when the appropriate functions and derivatives are continuous. In the 
second place, continuity of appropriate functions and derivatives allows 
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us to make any change we please in order of differentiation when x is 
differentiated more than once with respect to variables upon which it 
depends. 


Problems 11.19 


1 Let 
f(x,y) = x? + y? 


and observe that f(x,y) = 2x. Then perform every single step required to 
show in the most tedious possible way that 


lim f(x + Ax, y) — f(x,y) — 


Az—=0 Ax 2x. 


Then think about the whole business. 
2 If 
u (x,y,z) = x? + 2y? + 32, 
show that 
Uz(%,V,2) = 2% uy (x,9,2) = 4y ` u(x,y,2) = 6z 
wz(1,1,1) = 2, wy(1,1,1) = 4, u2(1,1,1) = 6. 


3 Supposing that p > 0 and 
p? =x + y, @ = tan”! z, 


find the first partial derivatives of p and ọ with respect to x and y and then use 
the formulas x = pcos ġ, y = p sin @ to put the results in the form 


ð$ — sing ðh _ Cos $, 


ð ð . 
ae = cos $, 3, = sing, an =T 


p° ðy pP 
4 Obtain the simplest possible expression for 

Ou 0% 

dnt t By? 
when 
(a) u = x? — y? (b) u = 3x?y — y? 
(c) u = log (x? + y?) (d) u = e cos y 
(e) u =xsiny (f) u = tant? 


(g) u = log vV (x — a)? + (y — b} 


Ans.: With one exception, each answer is 0. Remark: The equation 
Ou | Ou 
dri t Ay? = 


and those that appear in the following problems are called partial differential 
equations. It is never too early to start learning that the above equation is the 
Laplace equation. A function u is said to be harmonic over a region if it satisfies 
the Laplace equation and is continuous and has continuous partial derivatives 
at each point of the region. Problem 12 gives an example of a function which 


0 
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satisfies the Laplace equation over the entire plane but is, nevertheless, not 
harmonic over regions containing the origin. 
5 Prove that 


(1) styg =O 


when u = y/x, when u = log (y/x), and when u = sin (y/x). Continue opera- 
tions to prove that if f is a differentiable function of one variable, then 


d AIN pN TOROF 
Ox C) =r C) Oy- \x =f x} x 
and (1) still holds. 
6 Prove that the wave equation 


ðu 02% 
t bx? ~ OF 
is satisfied when 
(a) u = (x + at)? (b) u = (x — at)’ 
(c) u = extat (d) u = sin (x — at) 
(e) u = f(x + at) (f) u = g(x — at), 


it being supposed that f and g are twice-differentiable functions. 
7 Show that the function defined by (11.11) satisfies the heat equation 
„ĉu _ ĝu 
aw =z: 
ðx? Ot 
8 Show that each of the following functions satisfies the equation written 
opposite it: 


(a) u = ax + by *9, +93, = % 

(b) u = sin (x sin y) cos y 5 — sin y 5 = 0 
OS Sap owes Ott? 
D u= -y0 dE- a) TERET 


9 To simplify matters, we suppose that each function appearing in our work 
is continuous and has continuous partial derivatives of first and second orders. 
We begin acquaintance with the idea that if F or F (x,y,z) is a scalar function, 
then the vector function VF defined by 


(1) VF = ziti tE 

is called the gradient of F. If V is a vector function defined by 
(2) V(x,y,2) = P(x,y,2)i + O(x,y,2)) + R(x,y,z)k, 
then the scalar function V-V defined by 


ðP 3Q aR 


(3) VV =a, tay toz 
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is called the divergence of V. Finally, the vector function V X V defined by 


i j k 
(4) vxv=|2 2 2 
Ox dy dz 
P Q R 
is called the curl of V. The a form of (4) is 
OR\. dQ 
O vxve(S-S)i+ (5 - Sit (3 -F)e 
The inverted delta is called “del,” and we shall hear more about the formula 
ð. ð, ð 
(6) Ve gitgitak 
Meanwhile, use the definitions to show that 
OF OF 
(7) VVF) = 55 +54 + 52 
ij =k 
ð ð ð 
(8) V x (VF) =|0x dy ðz|=0. 
aF aF ak 
Ox Oy dz 


10 Start with the formulas 
ô. ô., ð 


F(x,y,3) = x? + y? + 2? 
V(x,y,2) = xi + yj + zk 


and calculate all of the following things that are meaningful: 


(a) VF (b) V-F (c) VXF 
(d) VV (e) V-V (PV XV 
(e) V-(VF) (h) V x (VF) (7) V(VF) 

(7) V(VV) (k) VX (V XV) (7) V-(V-V) 


11 Letting u be the thoroughly reasonable function having the value $p sin 


or p sin g cos @ at the point having polar coordinates p and œ, show that, 


rectangular coordinates, 


(1) u(0,0)=0, uy) = Gay (e t y 
Show that , , 
Ou y ðu x : 2 
(2) ax (x? + y2)? ðy (x? + y?)%8 (2 + y? ¥ 
Show that 
(3) lim uz(0,y) = 1, lim u,(0,y) = —1. 
y>0+ y0- 


Show that uz(x,0) = 0 for each x and that u,(0,y) = 0 for each y. 
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12 The function u for which u(0,0) = 0 and 


a) ues) = Gro (2 + 9? = 0) 


can be considered more than once. Show that 


x2 — v2 y2? — x? 
(2) Uzz(%,y) = 24xy CETO Uyy(x,y) = 24xy (x? + y2) 
and hence 

ĝu Ox 
(3) Ox2 + ay? = Ur2(x,y) + Uyy (x,y) = 0 


when x? + y? = 0. Show that (3) holds when x = y = 0. Show that 
(4) lim u(x,x) = œ. 
0 


g> 


13 Formulas more or less like 


(1) Fe) = f, Fes) dy 


often appear in pure and applied mathematics. We suppose that, for each x 
in some interval, the integral in the right member of (1) exists and is a number 
F(x). More advanced courses set forth conditions under which F’(x) exists and 


can be obtained by “differentiating with respect to x under the integral sign” 
so that 


br ð 
(2) Pe) = f 5 felen) | ay. 
When this procedure is correct, we can combine (1) and (2) to obtain the formula 
d fè br a 
3 — = — 
3) z |, fees) dy = f | = Fe») | ay. 


Verify that (3) is correct when 


(a) a=0,b =1, f(xy) =x+y 
(b) a =0,b = 1, f(x,y) = x? + y? 


(C) a =1, b = 2, fæ) = E 
x 


11.2 Increments, chain rule, and gradients Throughout this sec- 
tion, we suppose that u is a function of three variables x, y, z and we restrict 
attention to a region R in £3 over which u is continuous and has con- 
tinuous partial derivatives uz, uy, and u, In many examples the region 
R is the whole £3, but this is not necessarily so. Whenever a useful 
purpose is served, we can regard u(x,y,z) as being the temperature or 
pressure or potential or density or humidity at the point P(x,y,z). While 
u(x,y,z) cannot be a vector, it can be the scalar component in some par- 
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ticular direction of a vector. In any case, it should be recognized that 
our function can be of interest to men as well as to boys. Everything 
we do will depend upon the fundamental fact that there is an astonish- 
ingly effective way of estimating the number Au, defined by 


(11.21) Au = u(x + Ax, y + Ay, z + Az) — u(x,y,2), 


which represents the difference (or increment) between the values of u 
at two places. The basic trick is to divide and conquer the huge dis- 
crepancy between the natures of the two terms in the right member of 
(11.21) by subtracting and adding terms to obtain the telescopic sum in 


(11.22) Au = u(x + Ax, y + Ay, z + Az) — u(x, y + Av, z + Az) 
+ u(x, y + Ay, z + Az) — u(x, y, z + Az) 
+ u(x, y, 7 + Az) ~~ u(x,¥,2Z). 


Defining a function ¢(t) by the formula 
(11.221) $) = u(t, y + Ay, z + Az) 


enables us to apply the mean-value theorem to the difference in the 
first line, and similar functions apply to the other two. Thus we obtain 
Au = u,(x*, y + Ay, z + Az) Ax + u(x, y*, z + Az) Ay + u,(x,y,2*) Az, 
where x* lies between x and x + Ax, y* lies between y and y + Ay, and 
z* lies between z and z + Az. Our hypothesis that the derivatives are 
continuous allows us to put this in the form 


(11.222) 
Au = [us(x,y,2) + e] Ax + [uy(%,y,2) + e] Ay + [ue(x,y,2) + es] Az 


where e1, €2, e3 are quantities which approach zero as Ax, Ay, Az approach 
zero. This formula can be put in the form 


(11.223) Au = E + e| Ax + É + 3 Ay + E + 3 Az. 


It is quite possible to rub out the epsilons, replace deltas by dees, write 
the formula 


Ou ðu ðu 
(11.224) du = ax dx + By dy + an dz, 


and then undertake to explain the antics. At least for the present we 
adopt different tactics. 

Let three functions x, y, z be defined and differentiable over some interval 
T of values of t, and let the point P(t) having coordinates x(t), y(t), z(t) 
trace a curve Cin R as t increases over T. Let 


(11.225) w(t) = u(x(z), y(t), 2@)). 
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Looking forward to derivation of a formula (the chain formula) for w’(t) 
we write 

w(t + At) = u(x(t + At), y(t + At), z(t + At)). 
Thus 

w(t + At) = u(x(t) + Ax, y(t) + Ay, 2) + Ax), 


where 


Ax = x(t + At) — x(t), Ay = y(t + At) — y(t), Az = z(t + At) — z(t). 
Applying (11.222) then gives 


w(t + At) — w(t) = [u2(x(2), yt), 2(2)) + ellet + At) — x(2)] 
+ [uy(x(t), y), 2(4)) + edly + At) — y(i) 
+ [ue(x(t), y), 2(2)) + eal[z(¢ + At) — 2(2)], 


where e, e2, €3 are quantities which approach zero as At approaches zero. 
Dividing by At and taking limits as At approaches zero gives the chain 
formula (11.232). The following theorem sets forth conditions under 
which the chain formula is correct. 

Theorem 11.23 (chain rule) If u is continuous and has continuous 
partial derivatives Uz, Uy, Uz over a region Rin Es, and if 


(11.231) w(t) = u(x), y@), 2(4)), 


where x, y, z are differentiable functions of t over some interval T, and if the 
point P(t) having coordinates x(t), y(t), z(t) traverses a curve Cin Rast 
increases over T, then w 1s differentiable over T and 


(11.232) w'(t) = uxt), yC), 2())x"@) 
+ ull), y), 2))9’@) 
+ u(x(t), yE), 2(@))z'@) 


when t 1s in T. 


In case [x (4) + [y’(z)]2 + [z (t)]}2 = 1 so that P(t) moves along C with 
unit speed, the number w’(z) is called the directional derivative of u in the 
direction of the forward tangent to C at P. For this and other reasons, 
some of which will appear later, the chain formula (11.232) is extremely 
important. We temporarily suspend production of mathematics to 
ponder consequences of the rude fact that it takes a long time to write 
the formulas (11.231) and (11.232). We can wonder how much we can 
abbreviate these formulas without abbreviating the life out of them and 
without creating confusion that wastes more of our time than the abbre- 
viations save. We can write (11.231) in the form 


(11.233) u = u(x,y,2) 


and carry in our minds the idea that the left side stands for the value of 
a function of ¢ and so also does the right side but, on the right side, x is 
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linked to ¢ by the intermediate variables x, y, z which are functions of t. 
We can then abbreviate (11.232) to 


du dudx , dudy , ðudz 
11.234 —e yey oe. 
(11.234) di osdi dy d on di 
It turns out that, in practice, the version (11.234) of the chain formula 
is often very convenient. For example, if 


u = x? + y? + 2? x = sin t, y = é, z= 17, 
then 


d 
T = 2x cos t + 2ye + 4zt 


and it is, in fact, not always necessary or even desirable to express the 
right side entirely in terms of ż The abbreviations are not always so 
agreeable, however. In case the parameter tis x itself, we must employ 
considerable fortitude to comprehend the sentence containing (11.233) 
and the formula 

du ðu , Oud ðu dz 
11.235 ee y uay | OU az 
( ) dx dx  ðydx ~ dz dx 
It is awkward and perhaps even undesirable to be required to think of u 
depending upon x in two different ways. It is easier to let 


w(x) = u(x, y(x), 2(x)) 
and to write 
dw  Oudx , dudy , ðu dz 
ds dx dx dy dx! On dx dx 


and then observe that dx/dx = 1. While Theorem 11.23 is the fully 
meaningful theorem to which we can refer when meanings of symbols 
must be carefully set forth, we give a restatement of the theorem in 
terms of the simpler curly dee notation. 

Theorem 11.24 (chain rule, second version) If u is continuous and 
has continuous partial derivatives 0u/dx, du/dy, du/dz over a region R in 


Ez, and if 
(11.241) u = u(x,y,2), 


where x, y, z are differentiable functions of t over some interval T, and if the 
point P having coordinate (x,y,z) traverses a curve C in R as t increases 
over T, then u is differentiable over T and 


du _ðudx , dudy , ðu dz 


(11.242) hk ek Ha RE 


It is not often that we have an opportunity to make an observation 
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as valuable as the one that the right member of (11.242) is the scalar 
product of two vectors. Thus 


du ðu., , Ou. , ðu dx. , dy, , dz 
(11.25) a (Mi Bi + uy Si 4 Bi + Fe) 


The remarkable feature of this product is the fact that, for each ż, the 
first vector depends only upon the first partial derivatives of u at the 
point P(x,y,z) and the second vector is (when it is not 0) simply a forward 
tangent to C at P. 

Information is ready to gush from (11.25), and we make progress by 
learning about the first vector which is called the gradient of u at P and 
is denoted by Vu so that 


Ou. , ðu, , ðu 


The symbol V, an inverted delta, is read “del”? and Vu is read “del u.” 
We must always remember that Vu is a vector. For many purposes, it 
is very helpful to consider V itself to be an operator, 


ð, ð., ð 


which can be applied to a scalar function u having continuous partial 
derivatives to produce the gradient vector Vu. In case Vu = Q, this is 
the whole story and there is nothing more to be learned. Henceforth 
we suppose that Vu #0. Let the last vector in (11.25) be a unit vector v 
so that du/dt is the directional derivative of u at P in the direction of V. 


Then 


(11.262) at = (Vu)-v = |Vul cos 6, 
where |Vu| is the length of the gradient Vu and @ is the angle between the 
vectors Vu and v. Since —1 < cos 6 = 1 and cos 0 = 1, it follows that 
the direction of Vu is the direction in which the directional derivative of u 
at P attains its maximum value and that the length of Vu is this maximum. 
This is the fundamental intrinsic property of the gradient of u at P. 
Since cos x = —1, the direction opposite to that of Vu is the direction 
in which the directional derivative of u at P attains its minimum value, 
and —|Vu|is the minimum. Some applications are easy to understand. 
If u is temperature and Mr. Walker is at a place that is too cold to suit 
him, he will walk in the direction of the gradient of u, and the length of 
the gradient will tell him the rate (in degrees per meter, for example) 
at which his position becomes more comfortable.. 

Since cos x/2 = 0, directional derivatives in directions orthogonal to 
the gradient Vu will be 0, and this can make us think about level surfaces 
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(isothermal surfaces or equipotential surfaces, for example) upon which 
u has a particular constant value. If P o(%o,4'0,20) and some sphere with 
center at Po lie in our region R, then our hypotheses (including the 
hypothesis that Vu + 0 at Po) imply that there is a surface S consisting 
of points P(x,y,z) for which u(x,y,z) = u(xoyo,z0). Let C be a curve 
which lies on S and passes through Pp and has the vector v for its forward 
tangent at Po. Supposing that a point moving along this curve has 
coordinates x(t), y(t), z(t), we have 


(11.27) * ualt), y(t), 2@)) = u(xo, Yo, zo) 


and hence du/dt = 0. Therefore, (Vu)-v = 0. Thus Vu is orthogonal 

to each line which passes through Po and which is tangent to a curve on 

S. As Figure 11.271 may suggest, this is just 

what we mean when we say that Vuis a normal 

to the surface S at the point Po. Therefore, A D 

we can find the normal to the surface S having S A 

the equation u(x,y,z) = c, at a point Po on o ymn 

S, by finding the gradient of u at Po. To find 

the plane tangent to S at Po is easy; it is the _ 

plane through Po orthogonal to the gradient. Figure 11.271 
A review of matters relating to equations of 

lines and tangent planes may be in order. If the gradient of u ata point 

Po(x0,Yo,%0) is 


R R Vu 


AE 


Ai + Bj + Ck, 


then the equations of the line through Po upon which this gradient lies are 


x — Xo _Y — Yo 2 — 20 
(11.28) 4 =p =T 
and the equation of the plane through Po normal to the gradient is 
(11.281) A(x — xo) + Bly — yo) + C(z — z) = 0. 


The equations (11.28) are correct because they say that the scalar com- 
ponents of the vector from Po(x0,7o0,%0) to P(x,y,z) are proportional to the 
scalar components of the gradient. The equation (11.281) is correct 
because it says that the vector from Po(xo,yo,Z0) to P(x,y,z) is orthogonal 
to the gradient. 


Problems 11.29 


1 This problem requires us to think about some ways in which the formulas 
(11.21) to (11.25) can be used to solve problems. Suppose f is a given function 
having continuous partial derivatives. Suppose we want f(x,y,z) but we cal- 
culate f(x0,yo,20) because it is easier to calculate f(x0,yo,%0) or because x, y, z are 
unknown and xo, yo, Zo are the numbers we got when we measured them. How 
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can we estimate the error resulting from using f(%o,yo,%0) instead of f(x,y,z)? 
Ans.: Put 


x=%, VEY, Z=% Âx=x— xo Ay=y—yo, Az=z%— z 


in formulas from (11.21) to (11.223). Remark: Very often we do not want to 
hunt up books and copy formulas from them. See the next problem. 

2 Remember the following modus operandi because it is useful when properly 
used. As an approximation to the number Ax defined by 


(1) Au = u(x + dx, y + dy, z + dz) — u(x,y,2) 


use the number du defined by 
Ou Ou Ou 
(2) du = 9, 0% + 3, dy + ay 


Remember that (2) can, in appropriate circumstances, be obtained by differ- 
entiating with respect to ¢ by the chain rule (Theorem 11.24) and multiplying 
by dt. Note the similarity between this modus operandi and the one involving 


(3.96) 
3 Supposing that y = p sin @¢, derive the formula 


|dy| £ |p cos | |dø| + |sin | |dp| 


which gives information about the error in y resulting from use of erroneous values 
of p and ¢. 

4 Supposing that y = p cos ġ, where p and ¢ are functions of ż, find a 
formula for dy/dt in two different ways. First use partial derivatives and the 
chain rule. Then use ordinary (not partial) derivatives and the rule for differ- 
entiating products of functions of t. Make the results agree. 

5 Formulate and solve another problem more or less like the preceding one. 

6 Supposing that 4, B, C are constants for which 4? + B? + C? = 1, find 
the directional derivative of the function (or potential function) 


(1) Vy = ao 
(x — x1)? + O — y) + (@ — 21)? 
at the point (x0,yo,20) in the direction of the vector 
(2) D = 4i+ Bj + Ck 
by two different (or superficially different) methods. In the first place, put 
(3) x = xo + Át, y = yo + Bt, z = z + Ct 


in (1) and find dV/di by differentiating with respect to ¢ without use of partial 
derivatives. Then putt =0. In the second place, show that 
yy = — a ei + Y — yj + (z — 21)k 
[(@ — x1)? + (y — y)? + (@ — z) 
and calculate the scalar product (VV)-D. Then put x = xo y = yo, Z = Zo 
Make the results agree. Remark: It is worthwhile to observe that, by introducing 
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summation signs, we can easily extend all of these calculations to cover more 
complex situations in which 1 is a positive integer and 


y= $ t 
kar V(x — x)? + (y — ye)? + (2 — z)? 


There are times when it is not unreasonable to start with a special situation in 

which n 2 3 and make quite extensive calculations to learn about directional 

derivatives, gradients, and equipotential surfaces. An equipotential surface is a 

surface consisting of points (x,y,z) such that, for some constant ¢, V(x,y,2) = c. 
7 Let the temperature u at the point (x,y) in an xy plane be defined by 


u = e-* sin y. 


Modify the procedure of Problem 6 to obtain, by two methods, the directional 
derivative of u at (xo,yo) in the direction of the unit vector 
(cos a)i + (sin a)j. 
Make the results agree. 
8 Supposing that 
= Ax + By + Cz, 

show that 

Vu = Ai + Bj + Ck. 


Use this result to show that the equation of the plane tangent to the graph of 
the equation 


Ax + By + Cz = D 
at a point Po(xo,1'0,20) on the graph is 


A(x — xo) + B(y — yo) + C(z — z) = 9 
or 
Ax + By+Cz = D 
9 Supposing that 
(1) =5+54+5 


show that 
(2) Ve = Zi +j + 2k. 


a? b: z? 
Use this result to show that the equation of the plane tangent to the graph of the 
equation 


we ye wt 
(3) a + b? + z? I 
at the point (xo,4'o,20) on the graph is 
(4) 280 (a — ms) + 2 O - 
or 


Zo + 2 Dr + =o = ], 


Remark: In case a = b = c, the "an of 3) is asphere. Otherwise, the graph 
is an ellipsoid; see Figure 6.631 and the accompanying discussion. 
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10 When 
u = x? + y? — 2? 


the graph of the equation u = 0 is a cone having its vertex at the origin. If 
Po(%o,0,%0) is a point on the cone which is not the origin, show that the equation 
of the plane tangent to the cone at Po is 


xox + yoy — zoz = Q. 


Note that the gradient at the origin is 0 and that we have no information about 
planes (if any) that are tangent to a cone at its vertex. 
11 The graph of the equation 


2 2 
(1) z=% -ŽŽ 


is a hyperbolic paraboloid or saddle surface more or less like that shown in Figure 
6.672. The sections in planes parallel to the xz and yz planes are parabolas 
while other sections are hyperbolas. Letting 


x? 2 
(2) watt 5-% 


find the gradient of u at the point Po(xo,yo,%0) on the graph of (1) and show that 
the equation of the plane tangent to the graph at Pp is 


(3) ot _ Yo) EEE mo, 


a b 
12 Letting u be the left member of the equation 
(1) anxx + awy + az + bile + x) + anys + azyy + azyz + bely + y) 
+ agizx + azzy + azz + balz + 2) =c, 
where a21 = 412, 431 = 413, and azz = 423, show that 
(2) Vu = 2[lanx + ary + asz + bili + 2lazx + azy + azz + baj 
+ 2[azx + 4327 + a33z + balk. 


Supposing that the graph of (1) is a quadric surface S containing a point Po(xo,/o,20) 

at which Vu = 0, write the equation of the plane tangent to S at Po. Put the 

equation in the form obtained from (1) by the following ritual. Wherever the prod- 

uct or sum of two variables appears, award a subscript to the first factor or term. 
13 Prove that each plane tangent to the surface having the equation 


x7 + yh + 23% = a% 


intersects the coordinate axes in points that are projections upon these axes of a 
point on the sphere having the equation 


x? + y? + 2? = a?, 
14 Find the gradient of the function F for which 
F(x,y,z) = xyz — 1, 
find equations of the line normal to the surface having the equation 


xyz = 1 
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at the point (%o,¥o,%0), and find the equation of the plane tangent to the latter 
surface at the latter point. Ans.: 


VF = yzi + azj + ayk 
Yozo T X ozo 7 xoy 
Yozo(% — xo) + xozo(y — yo) + xoyo(z — zo) = 0. 


15 Prove that if the plane z is tangent to the graph of the equation xyz = a3, 
then m and the three coordinate planes are the boundaries of a tetrahedron 
having volume 9a3/2. 

16 Prove that if u, wz, uy, and u, are continuous over a spherical ball having 


its center at Po(x0,v0,%0), then u cannot have even a local minimum or a lucal 
maximum at Pp unless 


Uz (Xo,V0,20) = U y(%0,1' 0520) = 1t2(%0,%'0.%0) = 0. 


Hint: Tell what can be done when the gradient at Po is not O. If necessary, look 
at (11.262). 


17 Using the result of Problem 16, show that if 


u= ey 
ltt py +e 


attains a maximum at P(xo,¥o,z0), then xo = yo = zo. With this assistance, find 
the place where x is maximum and show that the maximum value of u is +/3/2. 

18 Remark: This is a remark for those who wish to see that gradients can be 
used to introduce a fundamental idea that is often used to solve more difficult 
problems. Suppose we are required to find numbers 
x, y, 2 for which f(x,y,z) is a minimum or maximum when 
x, Y, z are required to satisfy a supplementary condition 
of the form u(x,y,z) = c. To catch the idea, we suppose 
that 

fy) = x? + y? + 2? 


and that the graph of u = cis a surface more or less like 
that shown in Figure 11.291. It is easy to guess that if 
a maximum or a minimum occurs at P(x,y,z), then Figure 11.291 
the gradients of f and u at P must have the same or 

Opposite directions and hence that there must be a constant A such that 


Vf +AVu =0 


or 
(1) Vif + Au) = 0- 


We now make a profound observation. Finding values of x, y, z, and À for which 
(1) holds is equivalent to finding values of x, y, z, and A for which the first partial 
derivatives of the function w define by 


(2) w (x,y,z) = F(%,9,2) + Alu (x,y,z) = c] 


are 0. All this leads us to an idea that can sound very strange and which might 
be quite useless if it were not for the fact that it is the very useful idea that the 
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Lagrange multiplier A should be used in the following way. When we want to 
find numbers x, y, z for which f(x,y,z) takes extreme values when x, y, z satisfy the 
supplementary condition u(x,y,z) = c, we seek numbers x, y, z, A for which the 
function w defined by (2) takes extreme values. We illustrate use of Lagrange 
multipliers by using them to obtain information about a thoroughly important 
problem. Letting 
u = axx + axy + 413xz 
+ anyx + azyy + 423y2 
+ azızx + azzy + 43322, 


where do; = 419, 431 = 413, and azı = 423, we seek points on the quadric surface 
having the equation u = c which lie at least and (if they exist) greatest distances 
from the origin. In this and similar problems, we systematically use the idea 
that distances have extreme values when their squares do. To make answers 
come out in the forms that are familiar when this problem is attacked by other 
methods, we modify (2) by writing —A~! in place of \ and define w by the formula 


w = x? + y? + 22 — A“ ![u(x,y,z) — el. 


Equating the first-order partial derivatives to 0 gives the equations 


oe = 2x — 2N! (aux + airy + aiz) = 0 
Ow 
ay = 2y — 2N! (aax + azzy + asz) = 0 
ðw 


Jz = 2z — 271 (azıx + a327 + 332) = 0 


(a — A)x + a yt a3 2=0 
421 x + (doe — A)y + 223 z= 0 
aa x+ az y + (az — A)z = 0 


which are necessary for extrema. When c + 0, these equations and the equation 
u = ¢ cannot be simultaneously satisfied by (x,y,z) unless 


4i — À ai2 413 
a21 a22 — A 223 = 0. 
a3) 232 433 — A 


In many important cases, values of A satisfying this equation can be found 
(usually only approximately) and the problem can be finished. 

19 To become acquainted with Lagrange multipliers by solving easy prob- 
lems, find extrema of the first function when the second equation is required to 
be satisfied: 


(a) x? + y+ 2? ax +by+ezt+d=0 
(b) x? + y? y= x? — 4 
(c)x+ytz ATE AEE 


d pTa 
Solution of part (b): Let 
w =x 4 y + Aly — xL + 4). 
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The extrema (if any) occur when x, yY, A are such that 


a 
a = 2x — 2xd = 0 
3 
ao = 2yth=0 
and 
y = x? — 4, 


These relations imply that à = 1, y = —4, x = + \/3.5, and x? + y? = 3.75. 
20 Formulas more or less like 


(1) w= |" fey) dy 


often appear in pure and applied mathematics. It is supposed that u and 2 are 
functions of x and that, for each x in some interval, the integral in the right mem- 
ber exists and is anumber w. More advanced courses set forth conditions under 
which dw/dx can be obtained from the chain formula 


(2) dw _ _ ðw do , Owdu 


ax Oo EE aE 


When appropriate conditions are satisfied, applications of the fundamental 
theorem of the calculus give 


(3) Z [" HE) dy = feo) 
Saf fea) dy = = È S" Hay) dy = Flo). 


When (see the last of Problems 11.19) we can differentiate under the integral 
sign, we get 


Ow v ð 
(4) je = fife) oy. 
Substituting in (2) then gives the formula 
af ~ tae)? — flea tt 4 [SED g 
(5) ES Hew) dy = feo) F — flew) Fe + [LE ay, 


Verify that (5) is correct when 


(a) u = x, 0 = 2x, f(x,y) = x+y 

(b) u =X, J = 2x, f(x,y) = x? + y? 

(c) u = x, v = a + x, f(x,y) = 1/y 

(d) u = x?, v = x3, f(xy) = (x + yer 
(e) u = x, v = x, f(x,y) = log y 


21 It can be observed that our proof of the chain rule for functions of more 
than one variable is more straightforward than our proof of the chain rule (Theo- 
rem 3.65) for functions’ of one variable. Can this phenomenon be explained? 
Ans.: Yes. When we proved Theorem 3.65, we did not know about the mean- 
value theorem and, moreover, the mean-value theorem was inapplicable because 
we did not have the hypothesis that the derivatives exist over intervals and are 
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continuous. When we proved Theorem 11.23, we knew about the mean-value 
theorem and had enough hypotheses to enable us to apply it. 

22 This problem provides preliminary information about a way in which 
surfaces can be determined and studied. The circle in the xy plane having its 
center at the origin and radius a has the simple rectangular equation x? + y? = 
a?. We have seen, however, that it is often convenient to use the parametric 
equations 


(1) x = a COS ii, y =asinu 


and to recognize that, when r is the vector running from the origin to P(x,y), 
we have 


(2) r = (a cos u)i + (a sin u)j 


and the tip of r runs once in the positive direction around the circle as u increases 
over the interval 0 < u < 2r. This remark, in which « has been used where we 
ordinarily use 6 or ¢, is designed to lead us gently to the idea that if fi, fo, fs are 
suitable functions of two parameters u and 9, then the vector r defined by 


(3) r = fi(u,v)i + fo(u,2)j + fa(u,e)k 

will be the vector running from the origin to the point P(x,y,z) on a surface S 
for which x = f,(u,v), y = fo(u,o), 2 = fs(u,o). For example, when we use 
spherical coordinates r, ġ, 6 as in Section 10.1, the equation of the sphere S 
having its center at the origin and radius a has the spherical equation r = a. 
The formulas of Problem 3 of Section 10.1 then show that the point P on S 
having spherical coordinates r, 6, @ has rectangular coordinates x, y, z, where 


(4) x = a cos @ sin 6, y = a sin ¢ sin 0, z = a cos l. 


Except that the parameters are called @ and @ instead of u and v, we obtain a 
special case of (3) by setting 


(5) r = a(cos ġ sin ĝi + sin @ sin ĝj + cos 6k), 


and (5) is a two-parameter parametric equation of the sphere S a part of which 
is shown in Figure 11.292. When 0 < 6) <7 
and @ = ĝa (5) is the parametric equation 
of a circle Cp(ĝo), a geographic parallel, on 
S which the tip of r traces as @ increases from 
—r to m. When —r < ġo Sr and ¢ = ġo, 
(5) is the parametric equation of the semi- 
circle Cu(o), a geographic meridian, which 
the tip of r traces as 0 increases from 0 to 
m. We make only a few calculations to illus- 
trate the utility of these things. When 
0 < o <r, the foward tangent tı to Cp(6) 
Figure 11.292 at the point Po for which @ = ġo and 0 = bo 

is obtained by putting 6 = ĝo in (5), differ- 
entiating with respect to œ, and putting = @o in the result. Thus, 


(6) tı = a(— sin ġo sin Ooi + cos ġo sin ĝoj). 
Similarly, when —r < ġo <S r, the forward tangent tz to Cu(o) at the point 


— 
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Py is obtained by putting @ = ġo in (5), differentiating with respect to 6, and 
putting @ = ĝo in the result. Thus, 


(7) te = a(cos do cos Ooi + sin do cos Aj — sin Ook). 


While the north and south poles of a sphere are as good as any other points on 
the sphere, it often happens that methods involving spherical coordinates refuse 
to give information about them. When 0 < 69 < r, the two vectors tı and tz 
are nonzero nonparallel tangents to S at the point Po, and with the aid of the 
fact that S has a tangent plane and a normal at Po we conclude that the vector 
product of tz and tı must be a normal to S at Po. Letting N = te X ti, we find 
that N = a? sin Or, where ro is the vector running from the origin to Pe This 
shows that fois normal to Sat P. In working out this elementary fact we have 
shown how, in at least one case, (3) can be used to obtain information about the 
surface which it represents. 


23 As in (5) of the preceding problem, let r have its tail at the origin and let 
r = a(cos @ sin ĝi + sin ¢ sin Oj + cos Ok). 


Let ġ and 0 be differential functions of ż so the tip of r traverses a curve C on 
the sphere S aszincreases. Findr'(t). Ans.: 


r’(t) = a8'(t)[cos @ cos ĝi + sin ¢ cos 8j — sin Ok] 
+ adg'(t) sin 6[— sin di + cos gj]. 
24 Letr have its tail at the origin and let 
(1) r = (b + a cos @) cos di + (b + a cos 6) sin dj + a sin Ok. 


Let @ and 0 be differentiable functions of ¢ so that, as we can see with the aid of 
Problem 22 at the end of Section 2.2, the tip of r traverses a curve C on a torus 
T as t increases. Show that 


(2) r’(t) = a6’(t)[— sin ĝ cos di — sin @ sin @j + cos 6k] 
+ (b + a cos 6)¢'(t)[— sin di + cos gj]. 


Show that the two vectors in brackets are orthogonal. Work out the formula 
n = cos ¢ cos ĝi + sin ¢ cos ĝj + sin 6k 


for the unit normal to the torus. 
25 This problem illustrates the fact that there are situations in which the 
elegant and useful chain formula 


d _ dx ~ dy 
(1) 7 u(x,y) = us (x,y) a + uy (x,y) 7 


cannot be applied with impunity. Let 


— _ _*y* 
(2) u (0,0) < 0, u(x,y) _ x? + y? 
Let a and b be nonzero constants and let x = at and y = bt. Show that, when 
t = 0, the left member of (1) is ab?/(a? + b?) and hence is not 0 while the right 
member is 0. Remark: Because (1) is invalid, Theorem 11.24 implies that uz 
and u, cannot be continuous at the origin. 


(x? + y? = 0). 
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11.3 Formulas involving partial derivatives This section and its 
problems require us to learn more about partial derivatives and some 
formulas that have important applications. The first part of the section 
is a rather dismal discussion of unlovely terminology designed to promote 
understanding of curly dee abbreviations. To begin, let f be a function 
of two “‘secondary variables” x and y, and let gı and ge be functions of a 
single “primary variable” a. Then, in appropriate circumstances, we 
can set x = gi(a), y = ge(a), and define a function F of the single pri- 
mary variable a by the formula 


(11.31) F(a) = f(x,y) = f(gile), ge(e)). 


In this and all similar situations in this section, we suppose that the 
arguments of functions of one “‘variable’’ are confined to intervals over 
which the functions are differentiable and that the arguments of functions 
of more than one “‘variable” are confined to regions over which the func- 
tions have continuous partial derivatives of first order. Differentiating 
(11.31) with the aid of the chain rule then gives a result that can be 
written in ways that look very different. Using notation of one brand gives 


F'(a) = felga), go(oe))gi(a) + fulgi(a), g2(@))g2(a). 


This can be put in the form 


(11.321) F'(a) = falx,y)gi(a) + fy(™,y)go(a) 


and we are responsible for remembering that the secondary variables 
are linked to the primary variable by the formulas x = gi(a@), y = g(a). 
Next, we can put this in the form 


Hq) = Of ae 4 of dy. 
(11.322) F(a) = Ox da + dy da 
Finally, as we usually do when the numbers in (11.31) represent tempera- 
ture or something having recognizable significance, we denote the mem- 
bers of (11.31) by a single appropriately chosen letter, say u, and write 


du du dx , Ou dy 
(11.33) da ~ 3x da | dy da 
The abbreviated formula (11.33) is now expected to tell us that u is 
linked to a primary variable a by the secondary varables x and y that 
are identified by the fact that du/dx and du/dy appear in the formula. 
This formula makes sense and enables us to make calculations when, for 
example, we have the formulas u = x? + y, x = cosa, y = sin a. 
One application of this material is worthy of mention. Suppose that 
a function F(x,y) of two “‘secondary variables” and a constant ¢ are given 
and that y(x) is a function of the “‘primary variable” x such that 


(11.34) Fey) = ¢ 
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when y = y(x). In this case, x is both a “primary variable’ and a 
“secondary variable.” Differentiating with respect to the “primary 
variable” x then gives the formula 


(11.341) 


from which dy/dx can be calculated when 0F/dy = 0. This is a fact 
involving “implicit functions,” the idea being that (11.34) is not a formula 
that gives an explicit formula for y in terms of x, but (11.34), and per- 
haps some further restrictions, may nevertheless imply that y must be 
the unique member or one of the members of a class of differentiable 
functions. The real significance of (11.341) lies in the fact that it often 
enables us to obtain a useful formula for dy/dx without undertaking 
the sometimes difficult or impossible task of “solving” (11.34) to obtain 
a useful formula for y. Problem 7 at the end of this section gives sub- 
stantial information about this matter. 

A satisfactory development of our subject 
must call attention to the fact that the symbol 
du/dx in (11.33) loses its unambiguous meaning 
when it is taken out of its context and we are 
not sure that the “independent variables” are x 
and y. To prove this, we construct Figure 
11.35, in which P is supposed to be a point in 
the first quadrant. Witheach such pointP, we 
may associate the circle through P with center Figure 11.35 
at the origin and let 4 be the area of the disk 
bounded by this circle. If we consider x and y to be the independent 
variables that determine P, we obtain the first and then the second of 
the formulas 


(11.351) A = w(x? + y, 24 = 2rx. 

If we consider x and p to be the independent variables, then 
(11.352) A = 10, 24 - 

If we consider x and ¢ to be the independent variables, then 
(11.353) Aan og 4 _ oe. 


Since the symbol 04/dx has different values in different contexts, the 
symbol must be discarded or embellished when there is no clear specifica- 
tion of the identities of the independent variables. The first of the 
symbols 

ðA 


; miai 
y ðx 


ad 
Ox 


z=0, y=0 
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can tell us that 4 is a function of the independent variables x and y, 
the “fixed variables” being displayed at the bottom of the vertical line. 
The second symbol can tell us that the independent variables are x and 
y and that the partial derivative with respect to x is to be evaluated 
at the place for which x = y = 0. No sane person will bother with the 
embellishments in a very long calculation in which du/dx and du/dy 
must be written many times and their meanings are perfectly clear, but 
the embellishments are available when they are needed. 

Without introducing hordes of symbols for functions, we look briefly 
at applications of the chain rule in situations in which w is a function of 
x, y, z and, in addition, x, y, z are functions of a, $, y. The assumption 
following (11.31) applies here, and accordingly u is also a function of 
a, B, y. While such agreements are usually made without explicit men- 
tion, we solemnly proclaim that whenever the partial derivative with 
respect to a Roman coordinate x or y or z (or a Greek coordinate & or 
Bor y) appears, the thing being differentiated must be a function of those 
coordinates and the other coordinates of the same nationality are the 
“fixed variables.” If we keep 8 and y so rigidly fixed that it is unneces- 
sary to take this fact into account in our notation, then u is a function 
of x, y, z and x, y, z are functions of a and the version of the chain rule in 
Theorem 11.24 enables us to write 


du _ ðu dx , du dy Ou dz 
da @xda' dyda ` az da 


When we use partial derivative notation to convey the information that 
8 and y are fixed, we obtain the first of the formulas 


ðu _ dude ðu ðy , du'dx 

da ax ða Oy 0a = =dz Oa 

ðu dudx , dudy , du Oz 
11.3 = — 
(11.36) ôB dx əðß tyy dy dB tgz az 36 

ðu — ôu ôx , du Oy , Ou Oz 

dy  ðxðy dy Oy ` az Oy 
and the next two are obtained by similar processes. If, as usually hap- 
pens in applications, we confine attention to regions over which the equa- 
tions giving x, y, z in terms of a, 8, y can be solved to give a, $, y in terms 
of x, y, z then the same procedure gives the equations 


ðu Ouda , duOB , du dy 
ax dadx | OB ax | dy dx 
(11.37) Ou _ uda , du dB , du dy 
ðy da ay OB dy = ay OY 
ðu duda , du Op , du dy 
ðz dadz ` OB dz + ðy Oz, 
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Teachers and textbooks providing instruction in more advanced mathe- 
matics like to presume that their disciples know enough about functions 
and partial derivatives to be able to write the equations (11.36) and 
(11.37). We are not, at the present time, required to comprehend the 
various reasons why these equations are important, but we can make an 
observation. In case the first of the conditions 


de ðy as ða aB ay! 

ða da da! Ox Ox Boe | 

Ox Oy az! ida dB dy 
11.38 — — — <0 — = 0 
(11.38) a6 8 EE ayl~% 

ae ay as de aB ay! 

dy oy ðy dz dz Oz! 


is satisfied, the system (11.36) of equations can be solved to obtain 
formulas expressing du/dx, du/dy, Ou/dz in terms of du/da, du/dB, 
du/dy. Similarly, if the second condition in (11.38) is satisfied, then 
the system (11.37) of equations can be solved to obtain formulas express- 
_ing du/da, ðu/ðß, du/dy in terms of du/dx, du/dy, ðu/ðz. The determi- 
nants in (11.38) are called functional determinants or Jacobi determinants 
or Jacobtans, and those who are destined to encounter them later may 
become thankful for this preliminary glimpse of them. Those accus- 
tomed to use of matrices prefer to see the systems (11.36) and (11.37) 
of equations written in the forms 


Ou Ox oy Oz) [ðu Ou da OB ðyİ |ðu 
ða da da dal | ax ðx ax ax ax| |ða 
aul _ [ax ay as] Jou foul _ faa a8 ay] Jou 
eg} a8 ə apl jay ay| lay ay dy| lap 
Ou Ox dy ðz| jdu ðu da OB Gy} jou 
dy) lay ay ay) la az} lax az az) lay 


This permits use of the algebra of matrices to which an introduction was 
given at the end of Section 2.5. 

In the following problems, it is assumed that each of the given func- 
tions has all of the derivatives and partial derivatives we want to use 
and that these derivatives are continuous. 


Problems 11.39 
1 Assume that x, y, z are functions of a, B, y such that 
Fi(x,y,2,0,8,7) = 0 Fa(x,9,2,0,8,Y) = 0 f3(x,¥,%,0,8,7) = 0. 


Write the equations obtained by taking partial derivatives with respect to a 
and then find the condition under which these equations uniquely determine 
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ðx/ða, ðy/ða, Oz/Oa in terms of partial derivatives of fı, fo, fs. Partial ans.: 
The first equation can be put in the form 


fi ax | af Oy , Of ae _ ðh 
Ox ða Oy da dz da Oa 


and the required condition is 


ah af af 
Ox Oy Oz 
af fs ahlo 
Ox Oy Oz 
afs ðf af 
‘ax ay az 


2 Supposing that x and y are differentiable functions of a and 6 for which 


2x? + 3y — 2a? — 38 =0 
x? + 2y3— a — 26? =0, 


calculate 0x/da and 0v/da. Ans.: 
Ox  8ay? — 1 dy 2 — 4a 


da 8xy? — 2x ða 12y?—3 
3 Supposing that p > 0 and that p and @¢ are functions of x and y for which 
p cos @ = x, p sin gd = y, 


differentiate with respect to x and then with respect to y to obtain 


SP cos $ — p sin ġ SÊ t- , Že cos 6 — p sin $S = 0 
Ox Oy 
Be ing + pcos 20 0, d in b+ poo 9 38 = 


and solve for the derivatives to obtain 


ðP _ cos d, ðP _ sin ¢, 


ð$ _ _ sing ð$ _ cos d 
Ox dy 7 p 


Ox p dy p 


4 Copy the formulas at the conclusion of Problem 3 and use them in appro- 
priate places in the process of deriving the following formulas that are used to 
make transformations from rectangular to polar and cylindrical coordinates. 
Supposing that v is a function of x and y and that x and y are functions of p and 


@ for which 


(1) x=pcos¢, y=psin ¢, 
show that 
(2) Ou _ du dp | du ð$ 

Ox Opdx ðe 0x 


dy ~ Op dy ` ð$ dy 
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and hence 
Ou _ ðu du sin ¢ 
(4) dx dp °°? — 36 7 
ðu _ ðu . ðu cos $ 


Then show that the formulas 


AIEA ($)] 2 9 (du\) 9 
Ox? Op \Ox/ 1 Ox E (sz) | Ox 
Bu TÈ (2Y) 36 + 1 2 (29) 38 
dy? Op \ðy/ 3 dy Lð \dy/ d dy 
can be put in the forms 
ðu _ Ou 4, 4 9% sin @ cos ọ 
(6) amt Bp? cos? @ > aad 3 p 
ðu sin?d , Ou sin? ọ ðu sin @ cos ¢ 
+ opp tap p t? dd P 
ðu _ ĝu ð?u sin $ cos ¢ 
(7) ay? 7 Jp sin? @ + 2 3sðg p 
ðu cos? @ , ð?u cos? ¢ ðu sin @ cos d 
+ + —, — 2 = —— 
ðp p ð$? p? d¢ p? 


Show finally that 
ðu , Ou _ ðu , Lldu , 1 ðu 


(8) ax? + Gy? ap t p dp | pide? 
and hence 

ðu , u , Ou Ou , 1 Ou 1 ðu , 0% 
©) ai t aya t a ~ apt pap | phage! oat 


For use in the next remark, we note also the formula 


(10) 1 ðu _ 1 du cosm@ Ou 


yoy pôõp p®sin ¢ ð$ 
which comes from (5) and (1). 

5 This remark can be very helpful to those who will study brands of physics 
and engineering in which the Laplace equation, the heat equation, and the wave 
equation appear. While the operation might be tedious and need not be per- 
formed, we could copy all of Problem 4 with x, y, p, and @ respectively replaced 
by z, p, r, and 0. This shows us that if u is a function of z and p and if 


(1) z = r cos 6, p = rsin 8, 


ðu , Ou Om , l ðu , 1 8u 

TAT E TEE ET 

3) Lou lôu, cosd ðu 
pôp rðr fr sin 6 06 
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These formulas and the simple formula 


Late 1 
(4) pag? ~ Pain? 60? 


enable us to transform important expressions from cylindrical coordinates p, ¢, z 
to spherical coordinates r, ¢, 8. To put fundamental consequences of our results 
in the compact form which is very often used, we define V*u (read del squared u), 
the Laplacian of u, by the first of the formulas 


(5) V2u = a3 


u = Bp t pap | ptagt | de 
ð?u 2ðu , 1 Ou cos 0 Ou 1 ð?u 
(7) Viu = 972 var T 72962 * sin 0 00 T isin? 6 ag? 


Then (5) gives V2u in terms of rectangular coordinates x, y, z. As Problem 4 
showed, (6) gives V2u in terms of the cylindrical coordinates p, ¢, z of Figure 
10.11. As we see by adding (2), (3), and (4), the formula (7) gives V?u in terms 
of the spherical coordinates r, œ, 0 of Figure 10.12. 

6 The results of Problems 3, 4, and 5 are important. Instead of proposing 
that similar but less important problems be solved, the author suggests that 
these problems be solved again and again. 

7 This long problem involves a theorem which is called an implicit function 
theorem. The equation 


(1) x3 + xy + y? — 31 =0, 


which happens to be satisfied when x = 3 and y = 1, provides an introduction 
to the subject. If we know that y is a differentiable function of x for which (1) 
holds, then we can differentiate with respect to x to obtain 


(2) 3x? + a 2 ? 4 y + 3yeD = 0 
or 
dy 
(3) (3x? + y) + («+ 3y?) S 
and hence 
3 2 
(4) dy _ x? + y 


ax x + 3y? 


provided x + 3y2 = 0. We can be pleased by our abilities to calculate deriva- 
tives of differentiable functions, but we can also be irked and frustrated when we 
realize that we do not know whether there is a differentiable function f for which 


(5) x + xf(x) + U) — 31 = 0. 


With the aid of partial derivatives, we can obtain very satisfying information 
about hordes of problems of which the one considered above is a special case. 
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(6) Implicit function theorem Let G be a given function of two variables 
x and y such that G(xo,y0) = O and, moreover, the function G and its two partial 
derivatives Gz and Gy are continuous and G,(x,y) = O over some rectangular region 
R, having its center at the point (xo,yo). Then, for some positive number h, there 
is one and only one function f, defined and differentiable over the interval x» —h < 
x < xo + h, for which f(xo) = yo and 


(7) G(x, f(x)) = 0 (xo — h < x < xo + A). 


Moreover, when xo — h < x < xo + h and f(x) = y, f'(x) and A can be obtained 


by differentiating (7) or 


(8) G(x,y) = 0 
to obtain 
(9) Gz(x, f(x)) + Gy(x, f(x)) (e) = 0 
or 
(10) StI. 
and solving to obtain 
m ro- -RER 
or 
ðG 
dy 


Before proving the theorem, we observe that it provides very solid information 
about the graph of the equation G(x,y) = 0; the graph must contain a curve C 
which contains the point Po(xo,yo), and this curve C has a tangent at the point 
(xoyo) which has slope f’(xo). As sometimes happens in other cases, our proof 
of the theorem reveals some facts that are not stated in the conclusion of the 
theorem. To prove the theorem, we suppose that G,(xo,yo) > 0; in case Gy(x0,¥o) 
< 0, the proof is similar. Then the hypothesis that G,(x,y) = 0 over Rı and the 
intermediate-value theorem imply that G,(x,y) > 0 over Rı. Choose numbers 
yı and yz such that yı < yo < yo and R; contains the line segment consisting of 
points (xoy) for which yı S y S ye. Since Gy(xo,y) > 0 when yı S y S Yz it 
follows that G(xo,y) is an increasing function of y over the interval yı S y S y2 
But G(xo,yo) = 0, and therefore 


(13) G(x0,y1) < 0 < G(xo,y2). 


Since G is continuous, we can choose a positive number A such that the rectangu- 
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lar region R of Figure 11.391 is a subset of the given rectangular region R, and, 
moreover, 


(14) G(x,y1) < 0 (x —h<x < xo +h) 
(15) G(x,v2) > 0 (xo — h <x <x +h). 


The negative and positive signs in Figure 11.391 serve to remind us that G(x,y) < 
0 at points (x,y) on the lower edge of R and that G(x,y) > 0 at points (x,y) on 
the upper edge of R. We are now pre- 

ared to obtain results. Let xọ— h < 

Co =t Y») o Y») Coth 3a) i < xo +t h. Since G,(x,y) > 0 when 
yı Sy £ ya we conclude that G(x,y) is 
increasing over the interval yı S y S yo. 
Since G(x,yı) < 0 and G(x,y2) > 0, we 
conclude, with the aid of the intermedi- 


Sy oya) mth,y,)  ate-value theorem, that there is one 
(% - 2, y1) oN (xo “and only one number f(x) for which 
Figure 11.391 yı < f(x) < y2 and G(x, f(x)) = 0. 


Our proof is now about half done; we 
have found our f but we need information about f'(x). To start getting this 


information, suppose that x» — h < x < xo + h and xo — h < x + Ax < xo +h. 
Then 


(16) G(x, f(x)) =0, G(x + Ax, f(x + Ax)) = 0 
and consequently 
(17) [G(x + Ax, f(x + Ax)) — G(x, f(x + Ax))] 
+ [G(x, f + Ax)) — GG, f(x))] = 0. 


Applying the mean-value theorem (Theorem 5.52) to these differences shows that 


there exists a number £ between x and x + Ax and a number ņ between f(x) 
and f(x + Ax) such that 


(18) G.(E, f(x + Ax)) Ax + G(x mU + Ax) — f(x] = 0 
and hence 
(19) f(a + Ax) — f(x) _ _ Ge(& fe + Ax) 

Ax 7 Gy (xn) 


Since G, and G, are, by hypothesis, continuous at the point (x, f(x)), the desired 
result (11) will follow from this if we show that f is continuous at x, that is, 


(20) lim f(x + Ax) = f(z). 
Az>0 


To prove (20), let e be a positive number for which f(x) + € and f(x) — e lie 


between yı and y} Since G(x, f(x)) = 0 and G,(x,y) is an increasing function 
of y, we find that 


G(x, f(x) — €) < 0 < G(, f(x) + 6). 
Since G is continuous, we can choose a positive number 6 such that 


G(x + Ax, f(x) — €) < 0 < G(x + Ax, f(x) + ©) 
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whenever |Ax| < ô. Since G(x + Ax, y) is an increasing function of y and 
G(x + Ax, f(x + Ax)) = 0, it follows that 


f(x) — e < f(x + Ax) < f(x) + e 


whenever |Ax| < 6. This establishes (20) and our theorem is proved. Since f 
is continuous over the interval x» — h < x < x + h, the graph of y = f(x) 
over this interval is a curve C. Moreover, our proof shows that this curve C is 
the only part of the graph of the equation F (x,y) = 0 which lies inside the rec- 
tangular region R shown in Figure 11.391. 

8 This is another long problem. State and prove a theorem, similar to that 
of the preceding problem, in which it is assumed that G.(x,y) Æ 0 and the con- 
clusion involves a function ¢ for which G(¢(y), y) = 0. 

9 Prove that if Gis a function of x and y such that G, Ga, and G, are every- 
where continuous, and if (xo,yo) is a point on the graph I of the equation G(x,v) = 
O for which Gz(%0,yo) and Gy(xo,yo) are not both O, then the point (xo,yo) is a 
“simple point” on the graph. Remark: This is a theorem in geometry. The 
conclusion means that if R is a rectangular (or circular) region which has its 
center at (xo,yo) and which has a sufficiently small diameter, then the points of 
T that lie in R can be ordered in such a way that they constitute a simple curve 
or a Jordan arc; see the last two of Problems 7.19. The theorem implies that 
multiple points and isolated points of I can occur only at places where Gz and 
G, are both zero. 


10 For each of the equations 


(a) xy = 0 (b) x? + y?2 =0 
(c) y?— x? =0 (d) yi — x(x —1) = 0 
(e) y? — x*(x2 +1) =0 


the preceding problem allows the possibility that the origin may be an isolated 
point or a multiple point. What are the facts? 
11 Let 


u = [œ — 1)? + O- Yt + y? +3) 


and observe that the graph in the xy plane of the equation u = 0 contains only 
one point Pı. Observe that this result and Problem 9 imply that the two first- 
order partial derivatives of u at Pı must be zero. Calculate these derivatives and 
show that it is so. 

12 Let f be a vector-to-scalar function for which f(r) is a number (or scalar) 
whenever r is a vector in the domain of f. We do not need a coordinate system to 
define a number D(f, r, u) by the formula 


0) DU, 1, u) = lim HEF) AO) 


whenever r and u are vectors such that the limit exists. In case u is a unit 
vector, (1) provides an intrinsic definition of the derivative of f at r in the direction 
ofu. To start acquaintance with this matter, introduce a coordinate system 
and notation such that 


(2) r=xi+yj +k, fr) = f(x, y, 2) 
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and hence f(r + hu) = f(x + hui, y + huz z + hus) whenu = mi + uj + ugk. 


Then, assuming that f has continuous partial derivatives, prove that 
(3) D(f, r,u) = Vf-u. 


Observe that the scalar components of the vectors r, u, and Vf depend upon the 
coordinating system which was chosen, but that r, u, D(f, r, u) and Vf do not. 
Remark: The intrinsic definition (1) is particularly convenient when “gencral” 
or “‘abstract’”’ theories are being developed. In fact some parts of modern 
mathematics involve “‘vectors” that are “abstract elements” of “abstract spaces” 
for which appropriate axioms are valid. Some elegant theories are developed 
without use of coordinate systems. At the other extreme, some elementary 
developments of vectors in E; are tied so rigidly to a single sublime coordinate 
system that vectors are identified with ordered sets of numbers (the scalar com- 
ponents of the vectors). One virtue of (1) lies in the fact that it can be used 
when f is a vector-to-vector function of which the domain and range are both sets 
of vectors. Nobody ever learns all about all of these things on a windy Wednes- 
‘day, but people who keep studying mathematics do keep picking up ideas. 


I 2 Series 


12.1 Definitions and basic theorems Even though we have already 
had experiences with series, we start ab initio to develop the subject. 
An array of numbers and plus signs of the form 


(12.11) Uy + Uo + U3 + e.. o 

is called a simple infinite series or simply a series. The numbers s1, 52, 

53, © © * defined by sı = uy, 52 = uy + we, 53 = Uy + uz + uz, * * + and, 

in general, 

(12.12) Sn = }, Ue (n = 1, 2,3, +>), 
k=1 


constitute the sequence of partial sums of the series. If it happens that 


(12.121) lim Sn = 5, 
n— 0 
then the series is said to be convergent and is said to converge tos. If the 
limit does not exist, the series is said to be divergent. A series which 
converges to s is not the number s, just as a hand that contains 5 fingers 
587 
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is not the number 5. Nevertheless, we find it very convenient to abbre- 
viate the statement that the series converges to s by writing one or the 
other of 


(12.13) s = uit u: +t ust t’, s= Ý u. 
k=1 


The significance of this matter is usually not fully comprehended by 
unfortunate people who have not digested the contents of Problems 
6 and 7 of Problems 5.69. For present purposes, it is important to 
recognize that the equality signs in (12.13) do not have meanings like 
those of the equality signs in ordinary arithmetic and algebra, and that 
we cannot get s by ‘adding up” all of the terms of the series. Passage 
of time may possibly bring extinction to the habit of calling s the sum 
of the series. The trouble is that the habit makes the theory of series 
seem too easy for quick-witted superficial people, and, at the same time, 
seem too mysterious and difficult for everyone else. Keeping the impor- 
tance of (12.121) constantly in mind enables us to make rapid progress 
with the elementary theory of series. 
There are numerous reasons why the geometric series 


(12.131) atar ar +at+.:..> 


is important in advanced as well as elementary mathematics. We 
should always be well aware of the fact that ifs, is the sum of n terms 
of this series, then, when r = 1, 


(12.132) Sn e=altrt+rPr+--- +r) =a : —. 

If |r| < 1, then lim 7” = 0, so lim s, = a/(1 — r) and hence 

(12.133) j £ -= a+ ar+arr+arr+--- (r| < 1). 
If |r| = 1, the series diverges. 


The series 
(12.134) 1-14+1-1+1-14+1-1+::-;, 


in which the terms are alternately +1 and —1, has partial sums 


(12.135) 1,0,1,0,1,0, 1,0, -- - 
which are alternately 1 and 0, and we start cultivating a good habit by 
plotting the points sı = 1, s = 0, s3 = 1, 54 =0,-- - as in Figure 


12.136. There is clearly no s such that lim sa = s, and therefore the 


Figure 12.136 


O e—a 
Sw Sp Sg°** Sis $3, S5°** 
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series is divergent. A little thought about this matter can lead us to the 
idea that a series Zu, cannot be convergent unless un —> Qas n— x. To 
prove that the idea is correct, suppose Zu, converges tos. Then sn — s 
and Sn-1—> $ aS n— © and, since Un = Sn — Sn_s, it follows that un — 0 
as n—> ©. Several ideas can be obtained by investigating the series 


(12.137) $+$-4-4F-F4+d4+44+44+4-4-4-3 
1 1 
e n a 


and its sequence of partial sums. The nth term of the series approaches 
0 as n— œ. As we plot the points s1, 50, 53, 54, © © © , we find ourselves 
hopping to and fro between 0 and 1. The sequence of partial sums is 
bounded, but the series is not convergent. The possibility of learning 
about convergence of series will be enhanced if we obtain a full apprecia- 
tion of the way in which the following fundamental theorem is proved. 


Theorem 12.14 If 


(12.141) s= u t uzp u; t:> 
and 
(12.142) t=ntntate:: 


and if a and b are constants, then 
(12.143) as + bt = (au, + bm) + (aua + doe) + (aus + bo) + ° °°. 
To prove this theorem let, for each n = 1,2,3,--°, 
Sn = urt ut te Hun th Mtoe bt to 


Rules of arithmetic (or possibly algebra) allow us to multiply by a and b, 
respectively, and add the results to obtain 


asn + btn = (au, + bo) + (auz + bv) + + °° + (aun + don). 


Thus as, + bt, is the sum of n terms of the series in (12.143), and to 
prove (12.143), it is necessary to prove that 

(12.144) lim (asn + btn) = as + bt. 

It is now very easy to see how to finish the proof. The hypothesis 
(12.141) means that the first of the formulas 

(12.145) lim Sn = S, lim th =f 

holds, and the hypothesis (12.142) means that the second holds. Finally 


(12.145) implies (12.144) and the proof is finished. The above theorem 
and the next two lie at the foundation of the theory of series. 
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Theorem 12.15 Jf Zu, is a series of nonnegative terms, so that uy =0 


for each k, and if the sequence sı, 52, °° > of partial sums is bounded, so 
that, for some constant M, 
(12.151) Sn = urt ust >te tu SM (n =1,2,3,-- 5), 


then Du, converges. 


This theorem was proved in Section 5.6, but the theorem is so important 
that we think about it some more. The hypothesis that u, 2 0, uz 2 0, 


u3 2 0, -+ + implies that sı =u 20, s2 = s1 + Uz 251, 53 = Sg + 
us 2 $e, and, in fact, that 


(12.152) OSs,Se855;85 °°: S55M 


for each n = 1,2,3, -+ . Since the set # consisting of the numbers 
51, 52, 53, © © * is nonempty and has an upper bound, it must, according 
to Theorem 5.46, have a least upper bound which we can calls. Then 
Sn X s for each n. Let e> 0. There must be an index N for which 
sy > s — e, since otherwise s — e would be an upper bound of the set of 


numbers sı, 52, 53, © ` © and s would not be the least upper bound. It 
follows from (12.152) that 


(12.153) s-eXs,85 (n > N), 


and hence that lim sn = s. Therefore Zu, converges to s and Theorem 
12.15 is proved. In case the terms of a series Zu, are all positive and 
Sn S M for each n, the figure obtained by plotting the partial sums sy, 
S2, 53, © © * , the upper bound M, and the number s to which Zu; con- 
verges must look essentially like Figure 12.154. On the other hand, if 


0 Si Se S3 S4 s M 
Figure 12.154 


Du, is a series of nonnegative terms for which the sequence of partial 
sums is not bounded, then s, —> œ as n— œ and the series diverges. 


For series of nonnegative terms, and for such series only, it is convenient 
to use the first of the formulas 


Dug < ©, Zuk = © 


to abbreviate the statement that the series is convergent and to use the 
second of the formulas to abbreviate the statement that the series is 


divergent. In particular, a series Du, is said to converge absolutely, or 
to be absolutely convergent, if Z|ur| < œ. 


Before stating the next theorem, we look at the two series 


ao + aw + ax? + axt cee, Llo+xtetoites- 
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in which it is supposed that x is a positive number and ao, a, a2, °° ` 
are numbers, not necessarily positive, for which |a,| < 1 for each k = 
0, 1,2,- . Employing the admirable terminology of the following 
definition, we can see that the first of the two series is dominated by the 
second. 

Definition 12.16 The series u+ uz+ uz +--+: is said to be 
dominated by the series Mı + Ma + Mz + -> -if |u| S My for each 
kh = 1, 2, e., 

Observe that the terms of a dominating series must be nonnegative; 
the inequality |u| S Mı can never be satisfied unless M, 2 0. The 
following theorem is known as the comparison test for convergent series; 
it tells us that we can be sure that a given series is convergent if we can 
find a convergent series that dominates it. 

Theorem 12.17 (comparison test) Ifthe series ui + uz + ust =--> 
is dominated by a convergent series Mı + Ma + Mz + -> , then the 
dominated series must be convergent and, moreover, must be absolutely 
convergent. 

This theorem assures us, in slightly different words, that a given series 
must be convergent if we can find a series of bigger fellows that is con- 
vergent. For the case in which the terms are nonnegative, it assures us 
that a given series must be divergent if we can find a series of smaller 
fellows that is divergent. Our proof of the theorem depends upon use 
of two series 2p, and Zq, with terms defined by 


= |ux| — ur 


_ | zc, + Ux 
(12.171) pi = =, > 


that are useful for other purposes. Observe that p = uz when u 2 O, 
that p+ = 0 when up <S 0, and that 


0 S pr S |u S Mi 


in each case. Observe also that gą = 0 when u, 20, that g = — ur 
when u <S 0, and that 
0O S gr S |u| S Mk 


in each case. Observe finally that adding and subtracting the formulas 


in (12.171) gives 
(12.172) Uk = Pk — qk, lur| = be + gr- 


Letting M be the number to which the series 2M, of nonnegative terms 
converges, we see that 


Pit poet |- + bpn S Mit M: + .. * + Ma 
qı + q2 + <-> +g 8 Mit M: + +>- +M 
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It follows from Theorem 12.15 that the series 2p, and Èg are both con- 
vergent. It then follows from Theorem 12.14 that the series 2(p, — g) 
and Z(px + gx) are both convergent and hence that the series Dux and 
Z|uz| are both convergent. This completes the proof of Theorem 12.17, 

Let Zu, be a series that converges absolutely so that the series 2|1,| is 
convergent. Since the series Zu, is dominated by Z]|u,|, an application 
of Theorem 12.17 (the comparison test) gives the following nontrivial 
theorem. 

Theorem 12.18 If a series converges absolutely, then it converges. 

From the point of view of ordinary elementary mathematics, abso- 
lutely convergent series are the ones most easily manipulated. Series 
that converge but do not converge absolutely are quite respectable but 
can be troublesome. In this course, we learn relatively little about 
divergent series and (except for a brief excursion in Problems 5.69) we 
never assign values to them. From our present point of view, the asser- 
tion “Su, = œ” does not mean that the series has a value; it means that 
the terms are nonnegative and that the series has partial sums s1, 50, 53, 
- + e for which lim s, = œ. 


n—> æ% 


Problems 12.19 
I Tell the meaning of the statement 
0=0+0+4+0+04: 


and prove the statement. 
2 Using the approximations 


log 2 = 0.693,  qr/4 = 0.785, ¢=2.71, e= 0.37, 


draw the interval 0 S x < 1 on a rather large scale and mark the points whose 
coordinates are the partial sums 5), 52, © > * of the series in the formula 


(2) 0=0+0+04+0+0+4+--- 
()l=st+et+etoetat-:: 

(c) log2=1-—-4+3-E4+3-44+ °°" 
(@)r/4=1l-~g+3—-F+5-trt+::: 


1 l 1 1 l l 
(e) -2=y+yt+gt+tatataAtT 
l 1 l 1 1 
“Ponta ata at 
3 Sketch figures indicating the natures of the partial sums of the series 

G@)etetetetstetaetat+at+etetetet+etit 
(6)1—-2+3-44+5-647-84+-:- 
()14+0424+04+34+04+44+04::- 


(@)1-1+2~-24+3-344-44--- 
¢)Q€-N+2@-2+3-34+4-4+--- 
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4 Considering separately the cases in which x = 0,0 < x <1, and —1 < 
x < 0, use one or the other or both of the formulas 


_ ent 1 


y =] +++: t, poy lteter 


to determine the nature of the sequence of partial sums of the geometric series. 
5 Prove that if |a| < 1 for each k, then the series 


ao + ax + ax? + agx? to 


converges when |x| < 1. Hint: Use a dominating series. 


6 The preceding problem is important and must be thoroughly understood. 
Tell what is meant by the statement that the series 


(1) ao + aix + ax? + ax3 t ->.> 
is dominated by the series 
(2) 1+ |x| + |x]? + lft ---. 


Tell why the series (2) converges when |x| < 1. Give a full statement of the 
comparison test for convergence of series. Remark: We should hear very often 
that (1) is called a power series, that the numbers ap, ai, az . . . are called 
constants, and that the number x is called a variable. We should not, however, 
allow the terminology to interfere with our understanding of Problem 5. 

7 Prove that if 4 and p are positive constants for which 


| a,,| < A p* (k = 0, 1, 2, 3, °° -) 
then the series 


ao + axx + ax? + ax? + °°: 


converges when |x| < 1/p. Solution: In the stated circumstances |ax*| S 
A|px|* and the series is dominated by the convergent series in 


A 


_ #7 = 2 Sto. ee. 
= [px] A + Alpx| + Alpx|? + Alpx|? + 


8 Prove that if the series 


Co + eye + cox? + cP + 


converges when x = xo, then it also converges when |x| < |xo|. Solution: This 


problem is much like the preceding one. The hypothesis implies that jim CHG 


= 0 and hence that there is a constant M for which 


laxt| £ M k=0,1,2 >) 
If |x| < |xol, then 
|cpx*| S M\x/xo|* (k =0,1,2,-- -) 


and the series is dominated by a convergent geometric series. 
9 Show that 
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by showing that if s„ is the sum of n (that is, the first n) terms of the series, then 


1 
=0-Ð+6-Ð+6-Ð+ +l) 


Note that the middle sum is a telescopic sum. 
10 With the aid of a comparison of the two series 


1 1 1 1 1 1 
pratataret O z4 tgs t56 tE7t78Tt 
prove the first of the formulas 
al 1 1 61 © 1 oT? 
» p< 7 +t = 3a, > nm 6 
The second formula is a simple consequence of basic theory of series known as 
Fourier series. The result we have obtained is significant because 7? is about 
2 
10 and z is about 442 or $9. 
11 The first of the series in 


1 1 l 1 1 l 1 
1573 t35 t37t75t 77 and Lay t733 tag tst 


converges because it is dominated by the second and the second is convergent. 
Show that 0 < q < 1 and, if possible, find g. 

12 Prove that if a, 2 0 and Ba, < œ, then Daz < œ. 

13 This is a preliminary skirmish with the karmonic series 


(1) ltetaetetetet+etet ete. 


This series is divergent because its terms and partial sums are greater than or 
equal to those of the divergent series 


(2) l+etatetetetetetiet::: 
of positive terms. Let H(1), H(2), H(3), - - - denote the partial sums of the 
harmonic series and, with the aid of (2), show that H(2°) = 1, H(21) = 3, 
H (2?) > $, H(2) > $, and, in general, 

H(2") > (n + 2)/2. 


14 Note that the series in 


1 1 1 1 
123 t234 t gps tage t cept 


is dominated by the series in 
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and hence must be convergent. Try to find simple reasons why ¢<5s <4. 
If more time is available, show that s = 4. Remark: Proof that s = Ł can be 
based upon the identity 


1 1 1(@+1)-(—-1) 


-12$ +1) p(p?-1) 2 pp? = 1) 
i loss _ sol 
2L(p—1)p plo +1) 
15 With the possibility of using consequences of the facts that 
l+e+atit::: = 0, 
1 l l 1 
T2to3tgqgtgpe tcc =l, 
l 


l—x 


Itete teto (l| < 1). 


when these things are helpful, tell whether and why the following series are con- 
vergent or divergent: 


2 4 8 16 
@itmitpteatTet+t: 


2 3 4 5 
Olt+petpetetyat i: 
0? 12 22 32 42 
(9) gaz t geza t gag t petit 
1 1 1 1 


a) Itt 


sinx  sin2x — sin 3x | sin 4x sin 5x 
k) a t-a toga top tog 


Hint: The comparison test is important. Seek a convergent series that dominates 
your series (so you will know your series is convergent) or seek a divergent series 
which your series dominates (so you will know that your series cannot be con- 
vergent and must be divergent). 

16 Prove that 


9 (el 


2 5 luzo] < 5 2, {2 + 2 |o]? 
k=] 


k=] k= 
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whenever the series on the right are convergent. Solution: The inequality 


OS (uel — leak)? = eal? — 2] ual lel + forl? 
implies that 


2| wel o] S Jul + fol? 
and hence 


n Nn n 
2 $ hallal SD ul? + $ la s U +P, 
k=1 k=1 k=1 


where U and F are the numbers to which S|x,|? and =|9,|? converge. The resule 
follows. 


17 Imagine that a coin is tossed repeatedly and that we let a, = 1 if the kth 


toss produces a head and let x, = 0 if the &th toss produces a tail. Tell why 
the series in 


Xi x9 X3 
x = 71 + 72 + 73 + . o o 
must be convergent to a number x for which O < x < 1. Show that 


1 1 1 
X17" F x2—- F X37 F 
21 + 22 + 23 


1 p- 
x-53 


and 


1 2x1 — 1 2x2 — 1 2x3 — 1 
i i z top toy t 


Remark: This problem can steer our thoughts toward the Rademacher functions 
ri(t), re(t), © © > for which 


ONEONE ON 
i = 2 tae ts t 


when0O Si Sl. Figures 12.191, 12.192, and 12.193 exhibit graphs of the first 


y y 
1 1 
O 1 x O 1 x 
-1 
Figure 12.191 Figure 12.192 Figure 12.193 


three Rademacher functions. These things are important in the theory of 
probability and elsewhere. 


18 Look briefly at the following outline of a proof that e is irrational and then, 
with the textbook out of sight, write a proof in which more details are given. If 
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we suppose that e = m/n, where m and n are integers, we can suppose that 
n > 0 and put x = 1 in the formula 

z=] x? go yi 

e7 = ts +z +3 tg t a 
to obtain 


m _ 1, 1 1 
a a a a tat +t Dta@popit’ 
and then multiply by z! to obtain the formula 


1 
M = N+o99+ @Epeen tweneepegnt: a, 
where M and N are integers. Thus the quantity Q defined by 


S r S 
ERTSE ES CEPET I C EE ts + 


is the diference of two integers and must therefore be an integer. But 


1 
stit@ent wait: 


si 


n > n+1 << 
so Q cannot be an integer. 

19 Give a reasonable definition setting forth conditions under which a given 
series 


VitVeTVst °°: 


of vectors in £3 is said to be convergent. Show that the series will be convergent 
if the series 
[vi] + [ve] + [val + - °° 
is convergent. 
20 This problem involves rearrangements of series of nonnegative terms. 
Let u, 2 0 for each & and let 


s = Uy + uz + uz H ut s 


Let mı, mo, m3, > > * be a sequence of positive integers, not necessarily in their 
natural order, in which each positive integer appears exactly once. Prove that 


S = Um + Um + Um, + Um °° 
Hint: Let 


tn = Um, F Um + 78 Hb Um, 


Show that t, < s and that if e > 0, then t, > s — e whenever n is sufficiently 
great. 

21 This problem has a preamble. To pour acid upon the idea that each 
rearrangement of the series 


(1) 1-$4h-$44-$44-h4--- 
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must converge to log 2, we enter the construction business. Let Q1, Qo, Qs, - 

be a sequence of numbers. Let u(1), u(2), ©- - , u(n) be, in order, the first 
positive term of (1) together with just as many of the following positive terms as 
are necessary to obtain a cumulated sum s(nı) which eaceeds Qı. Let u(m; + 1), 
u(nı + 2), °° +, u(n2) be the first negative term of (1) together with just as 
many of the following negative terms as are necesary to obtain a cumulated 
sum s(m2) less than Qo Let u(me+1), u(n +2), °° >, u(n3) be the first 
unused positive term of (1) together with just as many of the following positive 
terms as are necessary to obtain a cumulated sum s(m3) which exceeds Q3. Let 
u(n + 1), u(n3 + 2), >> >, u(n4) be the first unused negative term of (1) 
together with just as many of the following negative terms as are necessary to 
obtain a cumulated sum s(n4) less than Q4, and then continue the process. Now 
comes the problem. Give precise information about the series u(1) + u(2) + 


u(3) + - - - and its sequence s(1), 5(2), 5(3), - -© - of partial sums when, for each 
nonnegative integer m, 


(a) Qm = 416, (2) Qm = Q, (c) Qm = 107 

(d) Qm = —10” (e) Qm = (—1)” (F) Qm = (—10)” 

22 Show that the first of the formulas 
lg2=1-4+4-ł+3-8+ł}-3 +5- trr- i+. 
žlog2=0+2+0-7+0+8t0- } +0+5 +0 -—+: 
slog2=14+0+3—-$4+5+0+7-4 +4+0 +4-4 +. 
glog2=1+3-—at+st+7-tt+5+y—-stirstis—-s +: 


implies the remaining ones. What, if any, new ideas appear in this problem? 
23 Let 0 <A<1. Suppose that a steel ball dropped from height A hits 
a steel plate 1/2/16 seconds later and immediately (without wasting time com- 
pressing and then expanding to reverse its direction) starts to rebound to height 
Ah to begin a similar bounce. Suppose that the ball continues to bounce in this 
way. Find the total distance D traveled by the bouncing ball. Find also the 


total time T, discovering that the small bounces occur so rapidly that the ball 
does not bounce forever. Ans.: 


D= Là pa MALEVA 


1—n ES 1 — VA 
24 A sequence xı, %2, x3, © © -is called a Cauchy sequence if 
(1) lim (xm — xn) = 0, 
™m,n— 0 


that is, if to each positive number e there corresponds a number P such that 


(2) \xm — xn] < € (m,n > P). 


It is easy to prove that each convergent sequence is a Cauchy sequence. Suppose 


(3) lim x, = L. 


R 0 
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Let e > 0. Choose an index P such that 
(4) [tn — L| < €/2 (n > P) 


Then, when both m and n exceed P, 


(5) |%m — Xal = [Gm — L) — (xn — L)| S lem — L| + |en — LI 
<¢/2+ €/2 = e, 


and the conclusion follows. Itis also true that each Cauchy sequence is convergent 
Proofs of this fact are much more difficult because they must, in one way or 
another, use completeness of the real-number system to produce the number to 
which the sequence converges. One lively proof starts with the choice of an 


increasing sequence P1, Po, > > - of integers such that, for each k = 1, 2, 3,- -, 
l 
(6) (xm — xnl <5; (m,n = P,). 
Then 
1 
(7) xP, = xp,| < Jk (k = ], 2, 3, > > *). 


It follows from the comparison test for convergence of series that the series 
(8) xp, + (xp, — xp) + (xp, — xp.) + (we, — xp) + 


is convergent, say to L. Since the sum of n terms of this series is xp,, it follows 
that 


(9) lim xp, = L. 


n- 0 


This shows that a subsequence of the given sequence converges to L. To prove 
that the whole sequence converges to L, let e > 0. Choose a positive number NV 
such that 


(10) lxn — Xm| < €/2 (m,n = N) 
and 
(11) |era — L| < €/2 (n 2 N). 


Then, when n > N, we can use the above inequalities and the fact that P, 2 n 
to obtain 


(12) |xn — L| = |(xn — xP.) + (xp, — L)| 
S |xn — xp,| + |xr, — L| < €/2 + €/2 


i 
M 


and our result is established. 


12.2 Ratio test and integral test This section contains more 
theorems about convergence of series. These theorems, like hammers 
and saws and other tools in carpenter shops, have their usefulnesses and 
we can cultivate abilities to make effective use of appropriate ones at 
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appropriate times. We begin by fumbling with the question whether 
the series 


(12.21) 12x 4 22x2 + 32x3 4 42x4 4 52y5 1 ee 


converges when x = 0.99. We set un = n?x” and obtain the first and 
then the second of the formulas 


un = n?(1 — 14)”, uo = 10,000(1 — 140). 


Since (1 — 1/n)” — 1/e as n—> œ, it is easy to reach the correct con- 
clusion that x10 is of the order of magnitude of 10,000/e and that there 
are several values of n for which uz > 1000. This can make us suspect 
that the series is not convergent, but it is still possible that the series may 
converge to some relatively large number of the order of 10° or 10?2, 
Appreciation of usefulness of the ratio test can now be gained by noticing 
that the simple calculation 


Unti Jim | (n + 1x 
Un | n— 0 | nx 


1\2 
lim (1 + 1) |x| = |x] 
n— o n 
shows that the series (12.21) converges when |x| < 1 and hence when 


x = 0.99. 


Theorem 12.23 (ratio test) Let ui + us + uz + -> + be a series 
of nonzero terms and suppose that 


(12.22) lim 


(12.231) lim 


In case p <1, the series Zun is absolutely convergent and lim un = 0. 
n> æ 


In case p > 1, the series diverges and lim |u,| = œ. 
n— œ 


To prove the first part of the theorem, choose a number r for which 
p<r<1. There is then an index N such that 


Unt 


Un 


(12.232) <r (n 2 N). 


Giving n successively the values N, N+ 1, N+2,--- yields the 
formulas 


Juny] < |un|z 

lunse| < |ungilr < |uxnlr? 
Juws] < [unselr < |uxlr? 
lunya] < lungalr < lunlr’ 
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etcetera. Thus the series 


un + UN+1 + ung + UN+3 + e. 


is dominated by the convergent series 


lun| + lun|r + luxļr? + fuylr? + > 


and the conclusion follows. To prove the second part of the theorem, 
choose a number R for which1 < R < p. There is then an index N such 
that |un41/un| > R when n = N and hence 


(12.233) |un+n| > |un|R? (p = 0,1,2,--°). 


Since uy # 0 and R > 1, this shows that |un| > «© as n— œ and com- 
pletes the proof of Theorem 12.23. 

The remainder of the text of this section involves a connection between 
series and integrals which is both interesting and important. Every- 
thing that we do can be easily understood and permanently remembered 


y=f(x) 
Ty 


KK HY Tz 
YY 1 ae yy T3 
Aen a, Aela 


—_ 


i i ug i , ' 
O 1 2 3 n-1 n n+1 x 
Figure 12.24 


with the aid of Figure 12.24. We suppose that the terms of a series Zu; 
and the values of a function f are related by the formula 


Uk = f(k) (k = 1,2,3, n ), 


that f is positive and continuous and decreasing over the interval x 2 1, 
and that f(x) — 0 as x— œ. The left member of the formula 


(12.241) Ð u= M flx) dx + |Til + IT] + {Ts 
k=l 
+- ITa] + Un 


is then the sum of the areas of the rectangles of heights u1, ws, °° * 4 Un 
that stand upon the unit intervals with left (or left-hand) end points 
at 1,2,3,- „n. The first term of the right member is the area of the 
region bounded by the graphs of the equations x = 1, x = n, y = 0, and 
y = f(x). For each k, |7;| is the area of the triangular patch Tę bounded 
by the graphs of x = k, x = k + 1, y = f(x), and y = ux. Elementary 
bookkeeping shows that the members of (12.241) are equal. The easiest 
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way to appraise the sum of the areas of the triangular patches is to put 
duplicates of these patches in the rectangle having opposite vertices at 
the origin and the point (1,u,). Setting 


An = [Til + ITe| + |o. + (Tail, 
we see that 0 < 4, < 43 < °> < An < uy. There is therefore a 
number C such that 
0 < lim [Tl + ITol + . . 8 + [Tail] = C < Ul. 


Putting Cn = 4, + un gives the following theorem. 

Theorem 12.25 (integral test) If f is positive and continuous and 
decreasing over the interval x = 1, if f(x) — O as x— œ, and if ur = f(k) 
for each k = 1,2, 3, > - - , then the sequence Ci, C2, C3, + * ` of constants 
defined by 


n 


Yu = in flx) dx + Cn 
k=l 


is convergent and O S Cn S u, and there is a constant C for which 


O< lim Cn = C S u. 
This theorem clearly implies the following theorem, which is known 
as the integral test for convergence of series. 
Theorem 12.251 (integral test) If f is positive and continuous and 
decreasing over the interral x Z 1, if f(x) — 0 as x— œ, and if ur = f(k) 


for each k = 1, 2,3, > > + , then Buz < œ if and only if | f(x) dx < oœ. 
It can be shown that Ci 2 C: Z Cz 2 -+ , and this result is some- 
times useful. The most important application of Theorem 12.25 involves 


the case in which f(x) = 1/x, Du, is the harmonic series, and the constants 
C, and C are called y, and y (gamma). This application gives 


(12.26) `, : = log n + Yn; 
k=l 
where the constant y for which 
(12.261) y = lim y, = 0.57721 56649 01532 86061 


is called the Euler constant. This constant y is, after r and e, the most 
important mathematical constant not appearing in elementary arithmetic. 
Putting f(x) = 1/x*, where s > 1, gives 


71 l 1 
(12.271) =- E ~ =| + C,(s), 


ysl 
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where 0 < C,(s) < 1. Letting n— œ and using the definition 
= | 
(12.272) = Sz 


of the Riemann zeta function {(s) gives the nontrivial formula 


—~@i1. l 
(12.273) f(s) = È B= soto: 


where 0 < C(s) < 1. The above results and the results obtained in the 
problems at the end of this section imply that 


(12.281) ) 5< o (s > 1) 
k=] 
(12.282) Ypre 6 < 1) 
k=1 
= 1 
(12.283) » Flog EF < 0 (s > 1) 
< 1 
(12.284) 2, Hog Ey = 0 (s Ss 1). 


These series are often used with the comparison test to determine whether 
other given series are convergent. 


Problems 12.29 


1 Use the ratio test to show that the series in 


2 3 4 
a) t=1l+e+5t+qtqt oo: 
x? x4 xê 
2+ 41 6 


3 5 y1 
(c) sinx =x tH - at 


(b) cos x = 1 — 


converge for each x, and that the geometric series in 
1 2 3 
@) po, 5 ltatet tsss 


converges when |x| < 1 and diverges when |x| > 1. 
2 Use the ratio test to show that the series 
Lle fDi? + Ble t diet foo 


diverges for each x for which x + 0. 
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3 Prove that if s is a constant, then the series 
sx + 28x2 + 3843 + 48x44 + eee 


converges when |x| < 1 and diverges when |x| > 1. 
4 When x = 1, the ratio test does not tell whether the series of the preceding 


problem is convergent. Using this hint, give an example of a divergent series 
Zt, for which 


(1) lim AH = 1, 


no Un 


and then give an example of a convergent series for which (1) holds. 
§ The nth term x, of the series 


2! ! 6! 
at apt Gye 


is (2n)!/(n!)?. Prove that 


ntl — lim n+ 2)Qn +1) _ 4 


lim 


no Un n— © (n + 1)(n + 1) 


and hence that the series is divergent. 
6 What information does the ratio test give about convergence of the series 


We (2)? 31)2 4!)2 
r+ QD 24 OP ae 


x3 + 


xtp ee? 


Ans.: The series converges when |x| < 4 and diverges when |x| > 4. The ratio 
test gives no information when x is 4 or —4. 


7 Supposing that m =Æ a, show that the series in 


1 1 x—a (x—a)?  (x-— a) 


me ieee ——— u > O O 


is a geometric series and that it converges to 1/(x — m) when |x — a| < |m — al. 
8 Supposing again that m = a, show how the calculation 


to. tt tt 
x—m m—x (m—a) — (x —a) m—-a,  x-a 
Mm —a 


can be used to obtain the formula of the preceding problem when |x — al < 
|m — al. Hint: We must always know that the geometric series 


lrer +rep... 


converges to 1/(1 — r) when |r| < 1, and we must sometimes be wise enough to 


start with 1/(1 — r) and write the geometric series that converges to it when 
Irl <1. 


12.2 Ratio test and integral test 605 


9 Write a complete proof of the fact that the formula 


x =t 1 _ x 
x2 — 9 9 +2 a4 


is valid when |x| > 3. Obtain a similar expansion of x/(x? + 1). 

10 For each n = 1, 2, 3, : - + let d(m) be the number of positive integer 
divisors of n, including 1 and n, so that d(1) = 1, d(2) = 2, d(3) = 2, d(+) = 
d(5) = 2, d(6) = 4, etcetera. Tell why the ratio test does not provide a useful 
source of information about convergence of the series 


d(1)x + d(2)x? + d(3)x3 + d(4)xt +--+: 


Tell why d(n)2 l and the series diverges when x 2 1. Tell why 1 Sd(m) Sn 
and the series converges when 0 Sx < 1. Hint: The series x + 2x? + 3x3 + 
4x4 + + + + is convergent when |x| < 1. 

11 Give two or more examples of convergent series vı + uz + uz + °° of 


positive terms for which lim Un+1/un does not exist. Ans.: The series 
n> © 


atetaetatetetretiet: : 
1 
— + — — — — — 
2 10 2? 10 235 108 24 10 


are simple examples. 
12 Show that the series in 


(1) ee eee 


e? — 


is a geometric series that converges to the left member when x > 0. Use this 
result to show that, when x > 0, 


(2) 7 = lim y xe ke, 


n>% Ty 


Show that if the manipulations 
> ? a; . kt dye = ` —kz 
(3) Í z ae Ee = Í lim 2 xe™kz dx = lim in » xe = dx 


are valid, then 
o x% a 1 
©) h zie Le 


Remark: We shall soon start hearing that the last series converges to 47/6. 
13 Supposing that n is a positive integer, sketch a graph of 


— 
n? + x? 


(1) y= 
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and show that 


n—-l 


1 ni i. z _ = 
@) Lape? h preta ih" t 
Use this result to show that 
o n—l 
1 
(3) È pA n2 + k? = ©. 


Remark: This result is of interest in cosmology. Suppose a particular universe 
contains an earth at the origin of a plane x, y coordinate system and contains a 
star like our sun at each point (x,y) for which x and y are integers not both zero. 
The rate (in appropriate units) at which the earth receives radiated energy from 
the star at the point (n,k) is then 
1 
ETA 


provided the inverse square law is applicable These hypotheses and (3) imply 
that the earth would receive energy at an infinite rate and hence would burn up 
instantly. This result implies that either the stars cannot be so uniformly dis- 
tributed or that (perhaps because other stars and interstellar material interfere 
with transmission of energy) the inverse square law is inapplicable. 

14 Supposing that 0 < s < 1, show that 


A 1 nanm-li 
ts + Cx(5), 
» k l — s 
where 0 < C,(s) S$ 1. Show that 
~ 1 
= = 2 — 2 Chn, 
» Ti Vn -2+ 


where O SC, < 1. Check the last result when n = 1 and when a = 4. 
15 Prove that, when n 2 2, 


$ 1 | D 1 
i, klogk pa (k + 1) log (k + 1) 

= log log n — log log 2 + Cn, 
where 0 < Cr < 1/(2 log 2). 


16 Prove that, when s > 1 and n 22, 


` 1 1 1 1 
2, Roe 7 =i tee 5 raa] + OO. 


where 0 S$ C,(s) < 1/2(log 2)", and that 


1 1 1 
» Blog ky ~ 5 — 1 log 2 T C(s), 


where 0 < C(s) S 1/2(log 2):. 
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17 With the aid of an appropriate figure show that, when s > 3, 


tol.. l 2 ] 1 1.1 
zt at +s — dx = sz 


2 x’ s — ] 2871 3 


and hence that 


l 1 
ye <SG) -1l<aS 


and 
lim ¢(s) = 1. 
&—> 0 


18 Sketch the graph of f(x) = x~?! over the interval 0 < x < 1 and observe 
that, even though f is unbounded and therefore not Riemann integrable, it can 
be suspected that 


n 
. k\~% 1 1 
(1) lim » (<) z = f x dx = 2, 


N> © 


where the integral is a Cauchy extension of a Riemann integral. In any case, 
use a result of Problem 14 to show that formula (1) is correct. Remark: One 
who undertakes to prove (1) without use of Problem 14 does so at his own peril. 

19 Suppose that f is nonnegative and continuous and increasing over the 
interval x 2 1 and that f(x) > © as x— œ. Let 


ur = f(k) (k =1,2, -+ 0). 


With the aid of a figure which is in some respects like Figure 12.24, show that to 
each n there corresponds a number 4, such that O < 4, < 1 and 


uit usp- + tn = f, fle) da + tn — n(n — 1). 


Applying this to the case in which f(x) = log x and 


n 


n 
» Uk = `, log k = log a!, 
k=1 k=l 
obtain the formula 


log n! = n logn — n + 1 + (1 — 4,) log n, 
and hence 


n! = neren An = (n/e)"en!~ Ax, 


Remark: This elementary calculation gives an introduction to the important idea 
that n! is of the order of magnitude of (n/e)”. It is easy to see with the aid of 
a figure that 4, is a little greater than $ and hence that n!—4= is less than Vn. 
In Problem 4 of Section 12.6, we shall discover (among other things) that if n 
is a positive integer, then 


n! = V 2nr nereti, 


where @, is a number for which 


1 1 1 
1 — zoni < 9n <1 — Om T 0m 
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and hence @, is quite close to 1 even when n = 1. The above formula for n! 
is a Stirling formula, and it is very useful. 

20 Weare familiar with the fact that 


o Í . h Dor xi-Pp A 1 
a) J, x? x = lim 1 x? ds = lim A | = 


when b > 1 and 


(2) . hj] . h 
(2) f = dx = lim = dx = lim log «| = 00, 
i . 


x hoo J1 XY h= œ 


Some questions and answers involving existence (or convergence) of Riemann- 
Cauchy integrals are quite analogous to questions and answers involving infinite 
series. 


(3) Theorem Let f and g be Riemann integrable over each finite interval 
a Sx S hforwhichh > a and let 


(4) 0 < f(x) S g(x) (x = a). 
Then 
(5) Os fo se) dx sf” a(x) ax 


when (5) is interpreted to mean that Í j f(x) dx S M whenever J, j g(x) exists and 
a 


CO. 


is M, and that N g(x) dx = œ if l f(x) dx 


Proof of this theorem depends upon the fact that 
h h 
(6) Os f Ia) dx s f el) ax (h = a). 


The functions in (6) are monotone increasing functions of h. In case JA g(x) dx 
a 
h 
= M, the function j) g(x) dx has the least upper bound M. The function 
a 


` f(x) dx then has an upper bound M and hence must have a least upper 
bound M, for which M, < M and 


(7) im fle) de = lim T fe) de = Lub. T fx) dx = M, < [ ° a(x) dx. 


In case T f(x) dx = œ, the function MEG dx does not have an upper bound, 
a 


h co 
SO Í g(x) dx cannot have an upper bound and hence Í g(x) dx = æ. This 
a 


proves the theorem. Supposing that p > 1 and that g is real, prove that 


I, Cog x)" dx < œ 


x? 
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by choosing a number r for which 1 <7 < p and showing that there is a con- 
stant A for which 


A 


LPT = xe 


(log x)? = (log x)? 1 1 


when x 2 2. Prove that 


æ ] 
J O8 X dx = œ. 
3 


x 


21 Using results of the preceding problem when and if they are helpful, prove 
that the first of the integrals 


1 xP o xP © x? 
lam ff, tin h cea 


exists when p > —1 and fails to exist when p £ —1. Prove that the second 
integral exists when q — p > 1 and fails to exist when g — p < 1. For what 
pairs of values of p and g does the third integral exist? 

22 The first of the two integrals 


(1) [tea [relax 


is sometimes said to converge absolutely if the second one exists. Prove the fol- 
lowing theorem. 


(2) Theorem If f 1s Riemann integrable over each finite intervalaSx Sh 
for which h > a and if A | f(x)| dx < 0, then [Pr dx exists. 
a 


Solution: This theorem and its proof are very similar to Theorem 12.17 and 
its proof. Let 


p(x) = sllf@)|+f@)), a) = allf@)| O, 
so that 0 < p(x) < |f(x)| and 0 < g(x) < |f(%)|. It then follows from the 


theorem of Problem 20 that the limits in 


h h 
lim Í, p(x) dx = L, lim I q(x) dx = Le 
ho © h= œ 


exist. Hence 


im fy" f) de= lim [ fy 9G) de fy a0) de | = La = Le 


23 Does existence of Í a [f(x)| dx imply existence of h f(x)dx? Ans.: 
No. For example, f(x) might be e~* when x is rational and —e~* when x is 
irrational. In this case |f(x)| = 27 and I |f(x)| dx = 1 but f is everywhere 


discontinuous and there is no interval over which f is Riemann integrable. 
24 Prove that if f and g are both Riemann integrable over each finite interval 
a Sx Sh for which 4 > a, and if 


FOL s ll (x & a) 
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then existence of Í j lg(x)| dx implies existence of ia f(x) dx. Solution: The 
a 


theorem of Problem 20 implies that J ° [f(x)| dx exists and the theorem of 
a 


Problem 22 then gives the required result. 
25 Prove that if f is integrable over each finite interval and the first of the 
integrals 


I | f(x)| dx, h f(x) sin x dx, A f(x) cos x dx 


exists, then the other two also exist. Solution: Since sin x and cos x are con- 


tinuous, f(x) sin x and f(x) cos x are integrable over each finite interval. Since 
also 


If) sin x] S [f@)[, |F) cos x] S If), 


the results follow from the results of the preceding problem. 


12.3 Alternating series and Fourier series The following theorem 
embodies the alternating series test for convergence of series. 

Theorem 12.31 If the terms of a series Du, are alternately positive and 
negative and if their absolute values decrease and have the limit Q so that 


lui] > fu > [u| > >+, lim [u| = 0, 
then the series converges to a number s for which 
(12.311) |s — $ un] < fons (n = 1,2,3, + +). 
k=1 


The inequality (12.311) tells us that if we use a particular partial sum 
as an approximation to s, then the error will be less than the absolute 
value of the first term of the series not included in the partial sum. This 


information is very useful. To prove the theorem, we suppose that the 
given series has the form 


(12.312) ay — az + az — a +H as — ast te, 


where a, > az > ag> > -' and ap —> Q as k— œ. To locate the par- 
tial sums s1, 52, © © © shown in Figure 12.313, we start at the origin, go 
to the right the distance a; to reach sı, then go left the smaller distance 
az to reach sz, then go right the still smaller distance az to reach 53, and 


Figure 12.313 


O $9 84 Sg °°° *** S7 Ss S3 Sy =a, 
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soon. Because the quantities in parentheses are positive, the formulas 


(12.314) So, = (ay ~ a2) + (a3 — a4) tree fe (akı — azk) 
(k = 1,2, -o *) 
show that O < s2 < 54 < xe < - -and the formulas 
(12.315) sSox-1 = @1 — (az — as) — (a, — as) — > + * — (an2 — axı) 
(k =1,2,:>»>) 


show that a4, = s1 > s3 > s5 > s> >>>. When n > k, these facts 
and the formula 


(12.316) Sant) = Son F Angi > Son 
imply that 
(12.317) 0 < Sak < Sen < Sont1 < S2k—1 < ay 


as Figure 12.313 indicates. The bounded increasing sequence sa, 5a, 
sg, °° © and the bounded decreasing sequence 51, 53, 55, © © + must have 
limits and these limits must be equal, since Song: = Son + Geng1 and 
dont. — Oasn— œ. Letting s be the value of these limits, we have 


(12.318) lim sp = S. 


n> o 
In case n is even, we have 5, < $ < 5,4; and hence 
0 < 5 — Sn < Sagi — Sn = (anil. 
This formula and a similar one holding when n is odd give the conclusion 
of Theorem 12.31. 

The remainder of the text of this section gives a preview of fundamental 
ideas about series that are called Fourier (1768-1830) sertes. While 
snatches of the story can be understood by everyone, most of the results 
are given without proof and it is necessary to study pure and applied 


mathematics for a few years to obtain a full appreciation of the whole 
story. Let L be a given positive number, and let functions ¢1, x 


o3, © © -© be defined by 
(12.32) h(x) = NE sinta (k = 1,2,3, +> -). 


Let E be the closed interval 0 < x < L. Because the little trick enables 


us to obtain formulas that have many other applications, we write Í M 


; L 
instead of h . It can then be shown that 


(12.33) falo de=, fp oola) dx =0 (j k). 


On this account, we say that the functions ¢1, ¢2, $3, °° ° constitute 
an orthonormal set over E. Now let f be a function which is defined over 
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E and is such that the two integrals 


(12.34) fpf) de, f, [fale de 


both exist as Riemann integrals or as Cauchy extensions of Riemann 
integrals. Supposing that n is a positive integer, we seek constants 
C1, C2, © © * , Cn for which the integral in the left member of the formula 


| ` 2 _ x)|? dx — . 2 
(12.35) fa fla) — È bela), de = fy lOl de — X la 


+) lee a? 
k=l 
will attain the least possible value. The first big step in the theory is 
made by working out the formula (12.35) in which the constants ay, ao, 
- are the Fourier coefficients of f defined by the formulasf 


(12.36) ar = |, fle) du(x) dx (k = 1,2, +>). 
The series 
(12.361) a1$1(x) + azġ(x) + a3zh3(x), nr 


in which the coefficients are defined by (12.36), is called the Fourier series 
of f. It is very easy to see that the c’s for which the right side (and 
hence also the left side) of (12.35) is a minimum are those for which 
Ck = ak. Putting c = a, in (12.35) gives the key formula 


n 12 > n > 
(12.362) fa | F(x) -» apela) | dx = fa I f(x)? dx — | Jar|2. 


k= 


Since the left side of (12.35) cannot be negative, we obtain the first and 
then the second of the inequalities 


(12.363) X las faloa X la s fa SO dx. 


The second inequality is called the Bessel (1784—1846) inequality. For 
most purposes, the important orthonormal sets ¢:, ¢2, © © © are those 
for which the members of (12.362) converge to 0 as n —> œ so that 


(12.368) lim fe | f(x) — D anbe(x) P dx = 0, 


X lal? = fy IAO ae 
k=] 

t These formulas were known by Euler. Fourier contributed very little to the theory 
of Fourier coefficients and Fourier series. The things bear his name, not because he 


invented them, but because he advertised their formal usefulness in problems of mathe- 
matical physics. 
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Such sets are said to be complete. While proofs of such things are so 
long and devious that nobody should expect to be able to originate them 
in a few days, it can be proved that the set in (12.32) is complete. The 
set 


(12. 365) z af? cos 7 n af cos 2 cos —— nf? cos = e. 


is also a complete orthonormal set over E when Z is the interval0 < x < 
L. The set 


(12.366) FE VEL Fj sing x, A os E x oa sin Ty, e. 
is a complete orthonormal set over E when E is an interval of length 2L. 
The world’s mathematical storehouse contains many other useful com- 
plete orthonormal sets, and the above formulas have many important 
applications. Henceforth we suppose that ¢1, $2, $3, © © - is the trigo- 
nometric orthonormal set appearing in (12.32) or (12.365) or (12.366) 
and that f has period 2L so that f(x + 2L) = f(x) for each x. Even in 
this case, fundamental problems involving validity of the formula 


(12.37) F(x) = aoi(x) + aepo(x) + aspel) + 


remain unsolved. It is, however, known that (12.37) is valid over 
—æ <x < © provided (i) f has period 2L, (ii) f is bounded and piece- 
wise monotone over —L <S x S L, (iii) 


(12.371) lim fe th) + fe =) 


h-0 


= f(x), 


(iv) f is odd so that f(—x) = —f(x) in case the orthonormal set is (12.32), 
and (v) f is even so that f(—x) = f(x) in case the orthonormal set is 
(12.365). 

The most illuminating batch of applications of the above ideas involves 
the Bernoulli functions Bo(x), Bi(x), Bo(x), - - + that appeared in Sec- 
tion 4.3, Problem 10, and in Section 5.3, Problem 19. These are the 
functions of period 1 for which Bo(x) = 1, 


(12.381) BY (x) = Baala) (n = 1,2,3, °° >) 
(12.382) I " B(x) dx = 0 (n = 1,2,3,- >) 


except that (12.381) fails to hold when n is 1 or 2 and x is an integer. 
In particular, Bı(x) is the saw-tooth function for which B(x) = 0 when 
x is an integer and 


(12.383) By(x) = x — [x] — $ 
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when x is not an integer and [x] denotes the greatest integer less than or 
equal to x. Problem 8 at the end of this section will show where the 


first of the formulas 


(12.384) 


—?2 | sin 2rx sin 4rx . sin 6rx , sin rx 
Bia) = 52 | Se t-z t- tog tie 


2 | cos2rx . cos 4rx , cos 6rx , cos 8rx 


2 [sin2rx . sin rx , sin 6rx | sin rx 


eS A es A ees A Ss | 


—2 [cos2rx , cos4rx , cos6mx _ cos 8rx 
B(x) = l 14 + J4 + 34 + 44 + ee 
—2 [| sin 2rx , sin4rx , sin6xx , sin 8rx 
Bs) = Ge Ga p t-g tog top ts | 
comes from. The remaining formulas come by successive integration; 
it can be proved that the series for Bo(x), Bs(x), Ba(x), © - © can be dif- 


ferentiated and integrated termwise [except that the series for B(x) is 
not termwise differentiable when x is an integer] and hence that the 
fundamental formulas (12.381) and (12.382) hold. Since Section 4.3, 
Problem 10, shows that 


l —l 
(12.385) B,(0) = n = 77 B,(0) = os = 790)’ 


putting x = Q in the formulas for B(x) and B(x) gives 
~ 1 T? < l a4 
(12.386) ¢(2) = » B= ¢(4) = È a= 


Balth Van der Pol used to claim that persons who know these formulas 
are mathematicians and persons who do not are not. 


Problems 12.39 


I Use Theorem 12.31 to show that each of the following series is convergent. 


a) 1-3+3-ł+%-%+:> 


i i i 
©) jog3 ~ ig4 t igs gét 
i 1 i 1 
(e) 108 log 20 log log 21 t fog log 22 log log 23 rt 


1 1 1 1 
© rfe- Tatra r atra- 
log 3 log4  log5 log6 1 
e) 2 _ OE ogi og plL... 
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2 Find, correct to four decimal places, the numbers to which the following 
series are convergent. 


1 2 3 4 5 

(2) 7 fo t 108 ~ Tot ET 
12? 22 32 42 52 

© 0- w t 108 ~ Tort Toe 


3 Show that the series 
xo x? x? xi 
gro tay tot 


for which the nth term is x" /(2n + 1), converges when —1 < x < land diverges 
when x < —l and when x 21. Haint: Some but not all of the information is 
revealed by the ratio test. 


4 With the aid of basic information about alternating series show that the 
series in 


1 l 1 1 l 
o S=proptpo Rt Ro 
converges to a number S for which 0 < S <1. Then show that, correct to 5D 
(5 decimal places), 

S > 0.75000 S < 0.86111 S > 079861 S < 0.83861 
Remark: One who wishes to invest a moment to pich up some ideas may start 
with the esoteric but important formula 

m? 1 1 1 l 1 1 
2 6 pratpratretet 
and obtain the formula 
r? 1 1 1 
(3) mg ~OtmtOt+EetOrar :- 
Subtracting twice (3) from (2) then gives (1) with S = 1?/12. Subtracting 
(3) from (2) gives the formula 
m? 1 1 1 1 1 
6) Ee 7 ptRt at Att 
which sparkles almost as brightly as (2). 
5 Supposing that 0 < x < 1, use the formula 


x4 


2 3 
log 0 +4)=2-5+3-7t 


to show that 
1 x x 8 
x-lg + =08(5-34+9-F4+--:) 


and 
0 < x — log (1 + x) < $x? 
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6 Supposing that x = 1/4, give one or more reasons why the series 


sinx sin 2x sin 3x sin 4x 


ney air er <n 


is not an alternating series to which Theorem 12.31 applies. 

7 The function ¢, being defined by (12.32), prove the formulas (12.33). 

8 It is not expected that the theory of Fourier series and its formulas have 
been learned, but it is expected that we can start solving problems when suitable 
formulas and instructions are given. Write the formulas (12.32) for the case in 


which L = 1 and show that the formula (12.36) for the Fourier coefficients 
becomes 


a, = V2 f f(x) sin krx dx. 


Letting f(x) be the Bernoulli function Bi(x) so that f(x) = x — ġwhen0 < x <1, 
show that 


1 
a, = vi |, (x — $) sin krx dx. 


Show that integration by parts gives 


— 1 1 
âk = a/2 | (« — z) = cos kra | + 4 Ef cos krx dx 


— V21 +coskr —V21+(—1) 
kr 2 ~ kr 2” 
so that a, = O when K is odd and ap = — ~/2/km when k is even. Observe 


that the conditions in the sentence following (12.37) are satisfied and hence that 


(12.37) must be valid. Then show that substituting in (12.37) gives the first 
of the formulas (12.384). 


9 Another particularly important example involves the square sine function 
(or square wave function) defined by 


Sin x = sgn sin x. 


The graph of this function is shown in Figure 12.391. To find the trigonometric 


Figure 12.391 


Fourier series of the odd function Sin (#x/L) which has period 2L, use the ortho- 
normal set (12.32) and, after observing that Sin (rx/L) = 1 when 0 < x < L, 
calculate the Fourier coefficients of Sin (mx/L) from the formula 


L al . krx 
a= f V2 sin 7 dx. 
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Then tell why (12.37) must be valid and substitute in it to obtain 
. we tf. we 1. Bre 1.5 1. 7 
Sin | = [siny + 3 sin -F + gsin y + 3 sin > + . |. 
Observe that this implies that 


wm. me l. Bre 1 , 5rx 1 . Trx 
g~ snp t3 yg tzin g tsinoy tiee 


when 0 < x < L and that putting x = L/2 gives the formula 


jaa 


=l-gts-7+E--- 


AIN 


which we have seen before. 


10 Engineers who are interested in fully rectified (or full-wave rectified) 
alternating currents would want x replaced by wt in the formula 


si = = [5 S _ S08 Ee _ 208 be cos 8x | 
mS L213 R57 GT 
but this shift is easily made. Work out the formula and observe that it is correct 
when x = 0 because 

1 1 1 
rates t37 t 7g 


11 Sketch a graph of the even function f of period 2r for which f(x) = x when 
Osx <7. Show that 


m 4 cos 3x cos5x . cos 7x 


12 Use the formulas following (12.384) to show that, when n is a positive 
integer, 


B,,(0) = (—1)*#1 any (2n) 


and hence that Ba(0) is a small number like 1/6% when n is large. Use the 
formula B, = k! B,(0) and the Stirling formula of Section 12.2, Problem 19, to 
show that 


—. 2n 
Bon = (1) 4 Var (Z) ¢(an)ebeite, 


Remark: Even crude estimates show that |Bon| is very large when n is large. 
Since me < 9, we have n/re > 10 and |Ban| > 10% when n 290. Similarly, 
|Ban| > 1000?” when n = 900. 

13 Our work with Fourier series has involved Fourier analysis. We started 
with given functions and found their Fourier series. While proofs of results lie 
beyond the scope of this course, we take brief cognizance of a problem in Fourier 
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synthesis. Let 1, 2, $3, © * > be a set of functions orthonormal over a set E. 
Let ai, a2, a3, °° > be given coefficients for which Zla,|2 < ©. Then, when 
Lebesgue integrals are used, there is a function f for which the formulas 


DO faae tim fyl 16) — Y abt) fae = 0 


are valid. Moreover the numbers a), az a3, * * * are the Fourier coefficients 
of f, that is, 


(2) a, = f fle) Bile) ax (k = 1,2,3, +»). 


In this formula we have recognized the fact that, in many important applications, 
p(x) is a complex number and “‘¢;(x) bar” is the complex conjugate of ¢,(x). 
It has not been asserted (and is in fact sometimes untrue) that the series in 


(3) f(x) ~ aig; (x) + alx) + ashs(x) + °° > 


converges to f(x). However, as the second formula in (1) shows, the sum of the 
first n terms of the series must be a good global approximation to f(x) whenever 
nis large. Persons who study Lebesgue integration and Fourier series for a year 
or two can learn all about these things. 


14 Supposing that o, 41, 42, © ©- > and By, Bs, - + © are bounded sequences 
of constants, let 


(1) fe) = Ao + > (Arcos kx + By sin kx) 
kal 


for those values of x (if any) for which the series is convergent. The series in 
the right member of (1) is called a trigonometric series. Some profound studies 
of the series in (1) depend upon use of the function F defined by 


= Ap cos kx + B, sin kx 
2 F(x) = bAa — Y ESOS eY T Ok SID ey, 
(2) (x) = gAqx » rr 


the series being convergent for each x because it is dominated by a convergent 
series of constants. Show that, when ż Æ 0, 


F(x — 2s) — 2F(x) + F(x + 22) 
3) ——— p 


= Áo + È [4 cos kx + By, sin kx] (3 =. 


Remark: Riemann discovered this formula and used it to solve some difficult 
problems. In more advanced mathematics, it is proved that the formula 


(4) lim a | uo + È uk (= =" | = uo + È Uk 
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is valid whenever the series on the right is convergent. From this it follows that 


6) tim ED — FG) + Fee A 
£? 


i0 


= f(x) 


for each x for which the series in (1) is convergent. We conclude with a brief 
outline of the sophisticated steps by which this sophisticated result is used to 
prove the following difficult theorem. If the series in (1) converges to O for each 
x, then Ay, = By = 0 for each k. The present hypotheses imply that F is con- 
tinuous and that the left member of (5), the generalized second derivative of F 
at x, is 0 for each x. These facts can be used to prove that F must be a linear 
function, and further arguments involving (2) can be used to prove that F(x) = 0 
for each x. Since F(0) = F(2r) = 0, use of (2) shows that 4p = 0. Further 
arguments involving (2) show that 4, = B, = 0 for each k. The theorem is 
called a uniqueness theorem because it implies that if f is a given function, then 
there can be at most one collection of constants 49, 41, 4, ° °° and Bi, Ba +> 
for which the series in (1) converges to f(x) for each x. This means that if the 
formulas (1) and 


(6) f(x) = Co + 2 (Cr cos kx + D, sin kx) 
=] 


are both valid for each x, then C, = 4, and D, = B, for each k. Our brief 
glimpse of the Riemann theory of trigonometric series can make us aware of the 
fact that the uniqueness theorem for trigonometric series has been proved, and 
the proof involves mathematical ideas that we have not yet assimilated. This 
matter is important, because trigonometric series appear even in quite elementary 
applied mathematics and we need some authoritative information to help us 
appraise the revelations appearing in textbooks that give superficial treatments 
of the subject. 


12.4 Power series The series 


(12.41) coterie — a) + cxx — a) + cel — a} tee, 


in which a and co, ¢1, c2, * * * are constants, is called a power series in 
(x — a). The fundamental reason for importance of these things lies 
in the fact that powers of (x — a) are relatively easy to calculate, to 
differentiate, and to integrate. Some power series, like the series 


(12.411) Of + 1%@%—a)4+2!(% — a)? + 3% — ait: 


converge only when x = a. Others, like those in the important formulas 
`N 


2 3 4 
(12.412) e=l+a+itatat 
2 4 6 
(12.413) coss =1- Ati at 


x3 x5 x 


(12.414) sins =x — ate 7 


tees, 
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converge for each x. The geometric series in the formula 


(12.415) po ltetetttte.. ({x| < 1) 


x 


is an example of a power series in (x — 0) which converges for at least 
one x different from 0 and diverges for at least one x. With each power 
series in (x — a) which converges for at least one x different from a 
and diverges for at least one x, there is associated a positive number R, 
called the radius of convergence of the power series, such that the series 
converges absolutely for each x for which |x — a| < R and diverges for 
each x for which |x — al > R. As Figure 12.42 indicates, the interval 
|x — al < R is called the interval of convergence of the series. 


Interval of convergence 
a-R a ath æ 
Figure 12.42 


We now give, without proof, some very useful information about power 
series. Those who are interested in proofs should study the theory of 
functions of a complex variable. If the power series in (12.43) converges 


when |x — a| <r and if for each such x we let f(x) be the number to 
which the series converges, then 


(12.43) f(x) = co + ci(x — a) + ca(x — a)? + cels — a} t- 


The function f thus defined is continuous over the interval |x — a| < r 
and, moreover, has derivatives of all positive orders which ‘‘can be 
obtained by termwise differentiation,” that is, when |x — al < r, 


(12.441) f'(x) = cı + 2ce(x — a) + 3¢3(x — a)? + 4e4(x — a) + À 
(12.442) f(x) = 12c: + 2°3¢3(x — a) + 3-4e4(x — a)? 


+ 4:5¢5(% — a)? + > 
(12.443) f(x) = 1-2-3cg + 2-3-4c4(% — a) + 3-4-5e5(% — a)? 


+ 4-5-6ce6(x — a)? + > 
(12.444) f(x”) = 1-2-3-4c4 + 2°3-4:S¢5(% — a) 

+ 3-4-5-6eg(x — a)? + °°: 
and so on, so that for each n = 1, 2,3, +> 


(12.445) f(x) = n!en + a+ Cnil — a) 


(n + 2)! 
+7] 


Capol — a) H o 


Termwise integration as well as termwise differentiation is permissible, 
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[raat f at-oas | c(t — a)? dt 


+f cst — a)idt+.-.. 


ea 4! 


that is, 


(12.45) | EE 


cola = a) +a FSO He 


(x — a) 


He pe 


when |x — a| <r. Moreover, we can multiply (12.43) by g(x) and, 
provided g is Riemann integrable over the interval from a to x, integrate 
termwise to obtain 


(1246) [ZIOA dt = fE cet) de + [7 exlt agh) dt 
+ [colt — a)?g(i) dt + more, 


Putting x = a in (12.43) and the formulas that follow it gives the 
remarkable formulas 
(12.47) fla) =o, f'(a) =a, f(a) = 2!eo, f(a) = 3!c3, 
f(a) = 4lcq, 


Solving these equations for co, c1, co, ‘ * * and putting the results in 
(12.43) gives the more remarkable formula 


(12.48) fla) = fla) + we - a) + ( a) 


3 
+O) (x — a)? + oe 

These formulas show one of the ways in which the coefficients co, ¢1, °° > 
in a convergent power series ean be determined in terms of the function 
to which the series converges. The series in (12.48) is the Taylor series, 
or Taylor expansion, of f in powers of (x — a), and our work shows that 
each convergent power series is the Taylor series of the function to which it 
converges. In case a = Q, the Taylor series is sometimes called a Mac- 
laurin series, but the practice has little justification and is being slowly 
abandoned. 

The following uniqueness theorem is used very often. 

Theorem 12.481 If r > 0 and if the two power series Èbi(x — a)* and 
Zcr(x — a)* both converge to the same f(x) when |x — al < r, so that 


(12.482) f(x) = bo + bi(x — a) + bx — a} +- (jxe-a <n) 
(12.483) f(x) = co + ci(x — a) + cole — a)? + °°: (jx — al < r), 


then by = cr for each k. 
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For example, the series in (12.412), (12.413), and (12.414) are the only 
power series in x that converge to ¢7, cos x, and sin x. To prove this 
theorem, we can start with (12.482) and show that b = f®(a)/k! just 
as we started with (12.43) and showed that cp = f®(a)/k!. 


Problems 12.49 

1 Learn the formulas 

a) esitet tA 
O) sins =e- 4th 
(6) cose =1- +AA 
@) ltet 


Write the four formulas obtained by differentiating formulas (a) to (d). 
2 Explain the steps by which the series 


sitie +e+ (l < 1) 


and modifications of it can be used to obtain the formulas 


1 
(a) ggg ortet <D 
2 3 4 
() log +x)=e-F4+5-Fto-- (<D 
1 
() 7p tlm tata ([x| < 1) 


x5 


3 7 
(d) mese- g+y 7+ (lx] < 1) 


(2) = er ET EEEE EEE . (|x| < 1) 
(f) lim log (1 + x) lt x) 1, lim x — log 1%) 1 
z—>0 x 2 
Jog (+: 2 3 4 
© f 04), Hye Hy. (lel < 1) 


3 We can object to the general principle that problems should be solved in 
inefficient ways, but nevertheless we can sometimes profitably sacrifice a few 
square feet of paper to promote understanding of a subject. Assuming that the 
series in 

sin 2x = co + cix + Cox? + cyx t etto 


converges to sin 2x, find co, C1, ĉa ° > > in the following way. Put x =Q to 
find co. Differentiate once and put x = 0 to find cı. Differentiate once more 
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and put x = 0 to find cə} Continue the process until c; has been obtained. 
Finally, see whether the result agrees (as it should) with the result of replacing x 
by 2x in the basic formula 
. xS x5 x 
sinx =x z tyy t 
4 Repeat the operation of Problem 3 to find the expansion of eè? in powers 
of x, it being assumed that b > 0 and that there is a power series in x that con- 
verges to 7. Tell how your answer can be checked. 
5 Assuming that there exist constants co, ¢1, Ca © -© for which 


f(x) = co + ci(x — a) + cx — a)? + cess — atoe, 

do enough differentiating and substituting to find the first few c’s when 

(a) f(x) = 1 — x)! a =0 

(b) f(x) = sta =l 

(c) f(x) = log (1 + x),a = 0 

(d) f(x) = logx,a=1 

6 Without bothering to write derivatives of the right member of the formula 

f(x) = co + e1(% — a) + calx — a)? + 03(x — a) +e, 


suppose that the series converges to f(x) and find the first few of the c’s with the 
aid of the formula 


f"@ f(a) 


RA aa) + aat EO — aH mt 


fle) = fla) +H 
when 

(a) f(x) = sin x, a =0 
(b) fx) = (1 — x) t a=0 

(c) fx) = 1 +x), a=0 

(d f(x) =x? — 2x2 +x- l,a=1 


7 Itis possible to apply the methods of the preceding problems to calculate 
a few coefficients in cases where the complexities of formulas for f™ (x) increase 
very rapidly as n increases. In some such cases, it is worthwhile to know the 
numerical values of the first one or two or three nonzero coefficients. Verify 
the first two nonzero coefficients in each of the formulas 


(a) tan x = x + gx? + fsx’ + gee? + rges? t+: («1 <5) 


(b) sec x = 1 + $x? + arri t+ fei + poe? t+ e (<3) 
(c) 1tetx2)¥ RET EE 
8 It can be proved that if the series in the first two formulas 


(1) f(x) = ao + ax + ax? + age? + att ets 
(2) g(x) = bo + bix + bax? + bax? + baxt t > 
(3) f(x)g(%) = co + cix + cox? + cox? + cert to: 
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converge to f(x) and g(x) when |x| < r, then the series in (3) will converge to 


f(x)g(x) when |x| < r provided the constants co, ¢1, ¢z, * © > are determined by 
the formulas 


Co = obo 
cı = dob; + aibo 
c2 = aob: + aibi + aabo 


c3 = aob3 + aib: + a21 + aabo 


etcetera. Observe that this is precisely the way we would write the product of 
the right members of (1) and (2) if they were polynomials. To obtain a bit of 
experience with these formulas, write the formulas to which (1), (2), and (3) 
reduce when ap = b = 1 for each k. Check your work by obtaining the third 
formula from the first in another way. 


9 Prove that if the series up + uı + uz + °° has partial sums so, 51, 59, 
- and if the series in 


f(x) = uo + uix + ux? + ua? +: 
converges to f(x) when |x| < 1, then 
f(x) = (1 — x) (so + six + sox? + sax? + ° + *) 


when |x| <1. Hint: Use the information given at the start of the preceding 
problem, putting bę = 1 for each k. 
10 Write two more terms of each of the series 


2 3 
e=- at 


pal rte aH 
e = 5x? 
ee ee e. 


11 It can be proved that if the series in the first of the formulas 


(1) f(x) = ao + ayx + ag? + aax? + axt t >o 
1 
(2) Fe) T Pot bie H bax? + bax? + bax’ t * 


converges to f(x) when |x| < rı and if ao Æ 0, then there exist numbers r2, bo, 
bi, b» * - - such that the series in (2) converges to 1/f(x) when |x| < re. Since 
the product of the left members of (1) and (2) has the power series expansion 


1 + Ox + Ox? + Ox? + 0x4 +, 


the coefficients bo, bi, ba > - -© can be calculated from the formulas 


1 aobo 

0 aod + aibo 

O = aob: + abı + abo 

0 = aob + aiba + abı + abo- 
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Use this idea and the known power series expansion of cos x to obtain some of 
the coefficients in the expansion 


seca = Lt dat + airet H yc + Biles pe 


12 We can be agreeably surprised by the simplicity of the operations which 
determine the first three or four of the b’s in the formulas 


sin x x? xit x 
| l-g tj at: 


x 
x 


= bo + bax? + baxt + dex + --- 


sin x 
and yield the formula 


7x3 31x5 127 


l x 
scx =y tatag tio t oo t 


which is valid when 0 < |x| < x. 


13 Start with the power series expansion of e? and use it to obtain the formula 


e — 1 
x 


x x7 o x4 
tata tatat- 


Find the formula obtained by equating the derivatives of the members of this 
formula and putting x = lin the result. Ans.: 


1 2 3 4 
=a tat gtyto. 
14 Determine the first six of the coefficients in the formula 
1 
Topix gp eTa tart aw + ay + os, 


Hint: Start by writing 


I = a + ax + ax? + ax + atte 
+ aw + ax? + ax? + agt+--- 
+ ax? + ax? + aox* + eoe 


and obtaining the formulas in the first column 


1 = a a = 1 
0 = ap + a a = —|I 
O=a tata, as = Ü 
0 = ai + a: + a; az= 1 


which determine the answers in the second column. 
15 Obtain the given coefficients and two more coefficients in the formula 


1 + 2 


l-x— 


atl t3xt a 7 t+: 
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Series 
Hint: Start by writing 
1+2x 
——— = a ax ax? azx? coe 
l] =x- x2 0 + 1 + 2 + 3 + 
and 
1 + 2% = ao + aye + ax? + ag? + agit -- > 
Then obtain the formulas in the first column 
1 = ao a=1 
0 = as — a; — a a= 4 
0 =7 


which determine the answers in the second column. 
16 The sequences 


(1) 1,1,2,3, 5, 8,13, 2l, >>- 
(2) 1, 3, 4, 7, 11, 18, 29, 47, -- - 


are examples of Fibonacci sequences, that is, sequences for which each element after 
the first two is the sum of its two nearest predecessors. A little work with power 
series reveals some surprising information about these famous sequences. Let 


Fo, Fi, Fa, + + + be a Fibonacci sequence. Letting g be defined by the first of 
the formulas 

(3) g(x) = Fo + Fix + Foe? + Foe? + Put oo 

(4) xg(x) = Fox + Fix? + Fox? + Feet t+: 

(3) x?g(x) = Fox? + Fix? + Fath e, 


tell how the next two are obtained. Subtract the last two formulas from the 
first and use the result to show that 


(6) g) = Ee 


1—x* — x? 


New and illuminating formulas are obtained by expressing g(x) as a sum of par- 
tial fractions and expanding these fractions into power series. To simplify 


writing, let 
-1 
(7) A= Veal = 0.618034, B= vers = 1.618034. 


Observe that 4B = 1. Show (the details are a bit onerous) that 


1 1 B A 
(8) rA Alota] 


and hence that, when |Bx| < 1, 


1 ç 1l 
Q —_ m — [Br 4 (—1)n 4r+]yn 
(9) l] — x — x? Pv (—1)r ArH]. 
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For the case in which Fy = F; = 1 as in (1), the formulas (3) and (6) and (9) 
show that 


(10) Fn = Wa [Bett + (—1)” 4+]. 


Show that, when n > 1, 


F, Both (—1) 4"> B + (—1)*4"+1/B” 


(11) Fi B- (194 “T Cipele 
and hence that 
(12) lim #2 = g= Vt 

n> k n—l 2 


Remark: The function g in (3) is called the generating function of the sequence 
Fo, Fi, F2, © © * . The things which we have done are of interest in many branches 
of mathematics and can be extended in many ways. 

17 Suggest a few ways in which the expansion 


2x? = 28x4 25x6 27x8 
sint = 97 — gr ber — er 


can be obtained. Remark: Perhaps the simplest way involves the identities 


2 A 6 
sin? z = $ — bose =p (1 -EE 4 E -2 . -X 


18 Supposing that x > 1, obtain the formula 


1 
tanix = 5 — lyh za t za 


in two different ways. First, use the formula tan™! x = 7/2 — tan”! (1/x) and 
then use a modification of part (d) of Problem 2. Secondly, start with the 


identity 
aiibi 


and integrate over the interval t 2 x. 
19 We often use the fact that the elementary expression for the left member 
of the formula 


a) fiu du = 82 4 EH Mog! | Ce 


+E + ay eg x)‘ pee, 


in which x > 0, has one form when s Æ —1 and has another form when s = —1. 
Prove that (1) is correct for each s. Hint: For the case in which s # —1, 
evaluate the integral and expand the result into a series with the aid of the fact 
that x#t! = ¢@+D lez, Treat the case in which s = —1 separately. 
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20 Without pretending to give a reasonable introduction to complex numbers, 
we take a hasty look at some remarkable formulas that involve these numbers. 
Let i, the so-called imaginary unit, be a number for which 2? = —1. It is possi- 
ble to set forth rules for operating with complex numbers of the form x + iy, 
where x and y are real. For example, if z = x + ty, then 


gi = (x + iy)? = x? + 2ixy + iy? = (x? — y?) + 2ixy. 


A complex number x + iy for which x and y are real and y = 0 is identified 
with the real number x, so the set of real numbers is a subset of the set of complex 
numbers. The series in the right members of the formulas 


(1) =1424+343 ttit 
2 4 6 

(2) ener eee 
3 5 7 

(3) sing = 2 — 3; zi = + 


are power series in z. With the agreement that a sequence xı + iyı, x2 + iy, 


- of complex numbers converges to Lı + iLeif lim x, = L,and lim yn = Lo, 
N—> O N> © 


it is possible to construct a theory of series of complex numbers. It can then be 
shown that, for each z, the series in (1), (2), and (3) converge to complex num- 
bers which are real only in special cases and which can be denoted by e7, cos z, 
and sin z even when they are not real. Thus é’, cos z, and sin z are defined for 
each complex z and the formulas (1), (2), (3) are valid foreach z. If w is a com- 
plex number, which may be real but is not necessarily so, we can put z = ww 
in (1) to obtain 


oe = 1 40+ Sh 404 104. 
j +0 +iw +0 4+ +o 4+ or 4 
Since i = —1, 3 = —i, 14 = 1, 44 =i, 18 = —1,°--, this formula can be 
put in the form 
6) wa(1-S 4H 94... )ti(w-F+F-F4---) 


Since (2) and (3) show that the series in parentheses converge to cos w and sin s, 
we obtain the first of the four Euler formulas 


(6) ge = cos w + isin w 
(7) ge = cos w — i sin w 
ew + e7 tw . ew — giu 
(8) cos w = —— s~; sin w = —~>——" 
2 21 
Since (2) and (3) show that cos (—w) = cos w and sin (—w) = — sin w, we 


can replace w by —w in (6) to obtain (7). Adding and subtracting (6) and (7) 
enables us to solve for cos w and sin w to obtain (8). Thus we have proved the 
four Euler formulas. These formulas are sometimes said to be the most remark- 
able formulas in mathematics. They have very many important applications, 
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and it is worthwhile to be able to start with a clean sheet of paper and write all 
of the formulas needed to derive them. 


21 If |a|!” < Ri for each sufficiently great n, prove that Za,x" converges 
when |x| < 1/Rı. Solution: When |a,|"" < Rj, we find that 
lan[/"|x| < |xRil, [anx™| £ |R|”. 
The hypothesis that |x] < 1/Rı implies that |xR;| < 1, so 2|xR,|" < æ and con- 
vergence of 2a,x" follows from the comparison test. 
22 If R:> 0 and ja,|'" > R, for an infinite set of values of n, prove that 
Zax” diverges when |x| = 1/Ra. Solution: When janl!” > Ro, we find that 
lan|"7|x| 2 Rex], — aax™| Z (Relx|)”. 
The hypothesis that |x| 2 1/Re implies that Re|x| = 1. Hence Ja,x"| = 1 for 
an infinite set of values of n. Thus it cannot be true that 


lim apx” = 
n— oo 


) 


and therefore 2a,x" must be divergent. 


23 Prove the following famous theorem, which is known as the Abel power 
series theorem. If the series in 


(1) S= a ta+atat::: 
converges to s, then the series in 
(2) Fr) = ao + ar + aor? + ag? + --- 
converges when Ò < r < land 
(3) lim f(r) =s. 

r>1l— 
Solution: Let sn = ao + ai + + + +> + an, so that 
(4) lim sn = 5. 

n— 0 


There must be a constant M such that |a,| S M and |s,| S M for each k. When 
|r| < 1, the series in (2) and the series in 


(5) Flr) = so + (s1 — 50)7 + (s2 — 5)? + (sg — sAr + 
are therefore both convergent and they both converge to the same number f(r) 
because 59 = ao, 51 — 59 = 4], Sp — 51 = 4}, °° * . Because the separate series 
are both convergent, it follows from (5) that 

(6) f(r) = (So + sir + sor? + + + *) — OF sor + ir? + + + +) 

and hence that 

(7) f(r) = (1 — r) Go + sir + ser? + oeoa. 

But 

(8) s= (l-rs +s +s+: 

and hence 


O fr) -s=(1—-7) » (se — s). 
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Let e > 0. Choose an integer N such that |s — s| < €/2 when n >N and 
N 


let C = X, |s — s|. Then, when0 <7 <1, 


k=0 
(10) lf) —s| SG —)CH+A—r) YY Ise — slot 
k=N+1 
<(l—7rnc+(1-—7) D 5” 
k=N+1 
<(l-ne+ 


If we choose ô such that (1 — r)C < €/2 when 1 — 6 <r < 1, we will have 
(11) f(r) — s| <e (l—8<r<1). 
This gives (3) and the theorem is proved. 

24 ‘Tell why the Abel theorem of the preceding problem implies that 
+ . o 


lim x 
gzl- 


“pt pe “l-t 


x? , x3 xt 1 1 1 
+) p p p 


and hence that part (g) of Problem 2 implies that 


l 1 1 1 
fredt+% yer E S ES 

25 Some of the most honorable parts of mathematics involve connections 
between the Riemann zeta function and prime numbers. Deriving a basic 
formula is a good exercise for us. Let p1, po, p3, © © © denote in order the prime 
numbers 2, 3, 5, 7, 11,13, - - +. Euclid proved that the set of primes is infinite. 
We use the fundamental theorem of arithmetic which says that if n is an integer 
for which n > 1 and if bx is the greatest prime factor of n, then n is uniquely 
representable in the form 


n = DDA 7 2 œ DK *, 


where the exponents Ai, Az, * * * , Ay are nonnegative integers. For example, 
504 = 2332507}, Show that, when Å is a positive integer and s > 1, 


1 1 1 1 
(1) —jzHltatgetetoc: 
1- 
28 
1 1 1 1 
(2) —j = !+RtEtR+ SC 
I- = 
38 
1 1 1 1 
©) att tetas tit 
5s 
1 1 1 1 
(4) —, =l +44; t-z tzt... 
1-4 Pi PEO ODE 
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Making wholesale use of the rule of Problem 8 for multiplication of series, show 
that 


"Á e e Ò ne e ~» l 


where the star on the sigma means that some of the terms for which n > pp 
are omitted from the series. Prove the formula 


6 s 1. 5 loş l. 
(6) APSA mS D pns 
and use it to prove that 
. 1 1 1 1 
(7) lim ——~- — — --- — = f(s). 
kop Lyd _1 TE 
Pi $2 P3 Pk 
Use (5) to obtain the inequality 
(8) 2i _ 2 2 a gl 
pi — 193-1 n on 


and show that taking the limit as s > 1+ gives the inequality 


Pi-—lpe-l fe -1 = 


(9) Pi b Pe > y + log Dy 


where y is the Euler constant. Show that neglecting the y and taking logarithms 
gives 


~ 1 
(10) DA log (1 + sai) > log log Dn. 
Show that x > log (1 + x) when x > 0 and hence that 
n 1 
(11) 2 1 > log log pr 
and 
(12) ) 1 > l log log Dn. 
yay Pe” 2 


Show finally that 
(13) —+—-+—-+ °°: =o. 


26 The theory of functions of a complex variable provides an elegant proof 
of the fact that if 
(1) f(x) = co + clx — a) + eo(x — a)? + ese — a)? + °° 
when |x| < R, then 
(2) f'(x) = c + 2ea(x — a) + 3es(x — a} + °° 


when |x| < R. Persons willing to go fishing even when no fish are caught can 
try to prove the result with rudimentary equipment. To simplify writing, 
leta =0. Then, when |xo| < Rand |x| < R, 


je) = eat = tim $ a 
k=0 noe k=0 
flo) = y chee = lim D Chib 
k=0 ">> k=0 
SO 
fle) — leo) = lim J, calet — À) 
n>% k=l 
and 
f(x) — f(x) = lim [c + D C(x! + sot oeo +b x0”) |. 
x — X0 n—> © k=2 


If we can prove that the limits exist, we can take limits as x approaches xo to 
obtain 


f'(x) = lim lim [c + >, cplx! + x 2x9 + - + x47?) |. 


Z> Zo N V 


If we can prove that the same result is obtained by interchanging the order in 
which limits are tahen, we obtain 


f (xo) 


Ii 


lim lim [c -+ D cpl! + xt eee + s51) | 


n> o TTo 


and hence 
pee) = im [at Stott] = Š teat 
N— © k=? Bni 


Our fishing expedition can be ended with the remark that we ran into questions 
involving iterated limits and change of order of limits that swamped us. 


12.5 Taylor formulas with remainders In Section 12.4, we started 
with given convergent power series and found that these series are the 
Taylor series of the functions to which they converge. In this section 
we Start with a given function f and study the general aspects and further 
applications of a method we have previously employed in special cases 
to obtain power series expansions of ¢7, cos x, and sin x. We suppose 


that a and x are confined to an interval over which f has all of the con- 
tinuous derivatives we want to use. Then 


(12.51) fe) = fla + [PFO ae. 
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Integrating by parts with 


u = f(t) dv = dt 
du = f'’(t) dt, v = —(x — t) 
gives 
fe) =K +O- ot rotna 
or 


(12.52) fle) = f(a) +9 @ — a) + f poa 
Another integration by parts with 


u = f" (À dy = e 9 dt 


du = fO) dp, v= EP 
gives 
fe) =f) +P e-d R e-a [py ESP a 


One more integration by parts gives the result of putting n = 3 in the 
formula 


12.53) f(x) = fla) +R a- a) +O we — a) 
+ |. +o (x = a)” + R,(x), 
where 


(12.54) R,(x) = + J 7 fot (2) (xe — 4)" di, 


and further integrations by parts give the results when n = 4, 5, 6, . 
The formula (12.53) is a Taylor formula with remainder R,(x). The “ight 
member of (12.54) is the integral form of the remainder. In some cases, 
(12.54) and other remainder formulas can be used to determine values of 
x for which 


(12.55) lim R(x) = 0. 
For such values of x the Taylor formula 


12.56) fe) =e) +E @ — a) +O we — arg. 
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is valid. The right member of (12.56) is called the Taylor expansion of f 
in powers of (x — a). Problem 5 gives an example of a function f whose 
Taylor expansion in powers of x exists and converges for each x to a 
number which differs from f(x) when x # 0. For this and other reasons, 
it is sometimes necessary to use (12.54) and other remainder formulas 
to obtain numerical estimates of |R,(x)|. 

The fact that (x — t)” is either always positive or always negative 
when ¢ lies between a and x enables us to use (12.54) to obtain other 
formulas for R,(x) that are sometimes, but by no means always, more 
easily used than (12.54) itself. The simplest and most widely used of 
these formulas is obtained by the observation that the value of R,(x) 
lies between the numbers obtained by replacing the factor f™+)(¢) by 
its Minimum and maximum values over the interval from atox. Hence 
the intermediate-value theorem implies existence of a number x* between 
a and x such that 


R(x) = TERE — t)" dt 
and hence 


— ford (x*) — nti 
(12.57) R(x) = (n+ 1! (x — ajm, 
This is the Lagrange form of the remainder. 
The binomial formula 


(12.58) (1 +a) = 1 +f MOD eg UDUD a 


—1)(¢ — 2)(q — 3 — 1)(q — 2)(q — 3)(q — 4 
+ WAVE D9) ay a DUDUA a 
+.. ; 


in which the exponent g is not necessarily an integer and |x| < 1, is used 
so often that many persons find it worthwhile to remember the rule by 
which the coefficients can be written. To prove the formula, we let 
f(x) = (1 + x)! and calculate the derivatives 


f) = gA t m, f = gl A H e, 
f”) = dq — 1) — 21 + oe, 
f(x) = gg -Iy-2)--- lnt DA H a. 


Since f(0) = 1, f’(0) = q, f’(0) = ¢(¢ — 1), - - - , the series in (12.58) 
is indeed the Taylor expansion of f in powers of x. To prove that the 
series converges when |x| < 1, we can apply the ratio test, but to prove 
that the series converges to (1 + x)2, we must show that lim R,(x) = 0. 
While we could (without being completely unfashionable) find that the 
Lagrange form of the remainder will work when 0 < x <1 and that 
another special form will work when —1 < x < 0, we shun these things 
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and use the integral form (12.54) to obtain 


(¢-—1)¢—-—2)---(g- 
(12.581) Rala) = DUI = 9) 


z E x — t\" 
[atom (GRY a 


The function ¢ for which $(t) = (x — #)/(1 + 2) is monotone over the 
interval from 0 to x and (x) = 0, so |¢(z)| must attain its maximum 
over the interval from 0 tox whent = 0. This maximum is therefore 
|x|. Hence 


(12.582) |R (| < a — Dg ~ 2) @ ~ 9) 


T 
Pi f (1+ 2) dt . 
| 0 i 
In case g is a nonnegative integer or x = 0, it is easy to see that 


(12.583) lim R(x) = 0, 

because R,(x) = 0 for each sufficiently great n. When g is not a non- 
negative integer and 0 < |x| < 1, an application of the ratio test gives 
(12.583). This establishes the binomial formula (12.58) for the case 
in which |x| < 1. 


Problems 12.59 


I With the aid of Taylor formulas with remainders, obtain the expansions 
of f in powers of x — a when 


(2) f(x) = ea =0 O) fe) = 7a =1 
(c) f(x) = sin x, a =0 (4) fe) = sinxa =F 
(e) f(x) = cos x, a = 0 O) f(x) = cosx, a =F 


2 Supposing that |x| < 1, write two more terms in each series appearing in 
the calculations 


ft 1 
ite = | ae 
GQ — a =14 22 (Hw + OED 
1 1-2 
=1+irti et 
z -3 
ste f [+e] 


1-3 1-3-5 
sinta =a + gga? H aag at + ppp tt coe, 
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3 Write two more terms of each series appearing in the calculations 


K J P l 
~ Jo I — k sin? 6 
(—2)(—=2) ie 3) 


(1 — x)% 148 x) + (xF + 


paker 
z/2 
K= Í. [its k? sin? @ + 5-4 kf sin’ G+: . -|a 


in which O < k < 1. Use of the formula 


x/2 1-3-5 - - - (2p — 1) ar _ 
f sin? ĝ dô = 7-4-6 - -- (2p) 7 (p = 1,2,3, ) 


then gives 


12-32-57 
K=3(i+% B+ a tegak t o) 
4 Supposing that 0 < k < 1, let 


E= [vi —# sin? 6 dô 


and modify the work of the preceding problem to obtain 


T 12-3 12-32-5 
E=5(1\- pH -ppt prat) 


It is not easy to know everything, and we may be unable to say whether our 


formula for E is valid when k = 1. Show that if it is, then 


12-3 12-32-5 12-32-52-7 
ptoept page TREET =l- 


5 While this may not be an appropriate time to enter into details of proofs, 
the function f for which f(0) = 0 and 


f(x) = eu? (x = 0) 


has continuous derivatives of all orders over the whole infinite interval — œ < 
x < æ. Moreover, f(0) = 0 for each k = 1, 2, 3,---. In this case the 
Taylor formula (12.53) with a = 0 becomes 


fe) =OF04 +--+ +04 R(x). 
The Taylor expansion of f in powers of x is therefore the series 
0+0+04+::- 


which converges for each x but converges to f(x) only when x = 0. 
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6 Little things like the formula 


ay a) = 40) +P a - 0 4+ 2M a - op 


(n) (0 (n+1) (¢* 

too FE a — 0 4 OO) a - oe 
and the capacitors that appear in electrical networks have surprising applications. 
As we shall see, simple applications of (1) give Taylor formulas for functions of 
“several variables,” ‘several? meaning more than one. Extensions to functions 
of more variables being easily made, we suppose that G is a function of two vari- 
ables x and y. We suppose that (xo,yo) and (x,y) are interior points of some 
convex region, a circular disk, for example, over which G is continuous and has 


all of the continuous partial derivatives we want to use. Supposing finally that 
0s: 81, let 


(2) fE) = G(xo + t(x — xo), yo + tly — 40). 


Then f(1) = G(x,y) and f(0) = G(xo,v0) and we can start production of the Taylor 
formulas. Differentiating (2) with the aid of the chain rule of Theorem 11.23 
gives 


(3) rO = Ga(x%o + t(x — xo), vo + tly — Yo))(% — xo) 
+ Gy(x%o + t(x — xo), yo + tly — yo))(y — y0). 


We can use (1) with n = 0 and obtain the primitive but nevertheless useful 
Taylor formula 


(4) G(x,y) = G(xoy0) + Ge(x*,y*) (œ — x0) + Gil*,y*) 0 — y0), 

where 

(5) x* = xo + t*(x — xo), Y* = yo + ty — Yo). 

To prepare for more elaborate Taylor formulas, we put t = 0 in (3) to obtain 
(6) F'O) = Ga(%oyo)(* — x0) + Gy%on¥0)(y — y0) 


and differentiate (3) with the aid of the chain rule to obtain 


(7) F(E) = Gaalxo + t(x — xo), yo + ily — yo))(% — 20)? 
+ Gay(xo + t(x — xo), Yo + tly — yo))(* — x0)(y — Yo) 
+ Gayo + t(x — xo), yo + t(y — y) — Yo) — xo) 
+ Gyy(%o + t(x — xo), yo + ty — 30))(Y — Yo)” 


We now have the material required to use (1) with n = 1. It is easy to continue 
the procedure, but the expressions we write will become more and more pon- 
derous unless we introduce simplifying notation. We begin by abbreviating 
(3) to the form 


ð 
6) fo =[@-wet o- 5,6 


and observing that, on account of the equality of the mixed partial derivatives, 
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(7) can be put in the form 
0 ð 7% 
(9) Pa =[@ - 2) +O - wz] C 


when k= 2. It turns out that Taylor formulas with remainders are obtained 
by substituting (9) into (1), the partial derivatives being evaluated at (xo,y0) 
when k S n and at (x*,y*) when k =n + 1. 

7 LetG bea function having continuous partial derivatives of first and second 
orders over a neighborhood of (xo,yo) in which (x,y) is supposed to lie. The 
Taylor formula of Problem 6 which terminates with second derivatives then takes 
the form 


(1) G9) = G@o,yo) + Gz(x0,y0)(* — xo) + Guloy) — yo) 
+ a[Gee(x*,y*)(% — x0)? + 2Gay(x*,y*)(@ — xo) (y — yo) + Gyy(x*,9*)(y — 0), 


where x* lies between xp and x and y* lies between yo and y except that x* = x, 
when x = xo and y* = yo when y = yo. This formula is useful. For example, 
it provides an easy way of estimating the difference between G(x,y) and G(x0,0) 
that is especially useful when |x — xo| and |y — yo| are so small that the last term 
is negligible in comparison to the two preceding terms. In particular, (1) gives 
us a chance to estimate the magnitude of the error involved when the number 


dz defined by 


(2) dz = Gz(%o,yo) (x — xo) + Gy(%o,¥0)(y — Yo) 
is taken as an approximation to the number Az defined by 
(3) Az = G(x,y) — G(%o,¥0)- 


It is quite possible to spend a few days solving problems involving these ideas, 
and the investment of time might even be a reasonably good one. We invest a 
few minutes to study extrema (local and global minima and maxima) of G. If 
G(x,y) has an extremum at (xo,yo), then G(x,yo) must have an extremum at xo 
and hence Gr(xo,yo) = 0. Similarly, if G(x,y) has an extremum at (x0,yo), then 
G(xo,y) must have an extremum at yo and hence G,(x0,yo) = 0. Thus G cannot 
have an extremum at (%o,yo) unless 


(4) Gz(x0,¥0) = Gy (o,Yo) = 0. 


To investigate the question whether G has an extremum at a point (xo,yo) for 
which (4) holds, we put (1) in the form 


(5) Gy) — Glxoyo) = ZI(4 + €1)h? + 2(B + €2)hk + (C + €3)k?], 
where 
(6) A= Gzz(%0,Yo); B= Gry (%0,¥0), C = Gyy(%0,¥0); 
h = x — xo k = y — yo, and the numbers €), €2, €s depend upon h and k and are 
small when |4| and |k| are small. In case 4 0 we can, when |A| and |&| are 
small enough to make 4 + e, # 0, put (5) in the form 
l 

(7) G(x,9) — G@oyo) = A+ a) {[(4 + 61)h + (B + €2)k]? 

+ [(4 + @)(C + e) — (B + €2)°]k?}. 
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In case AC — B? > 0, the quantity in braces will be nonnegative whenever 1 
and |&| are small enough to make 


(8) (4 + 41)(C + €3) — (B+ €)? > 0. 
It follows that if (4) holds and 
(9) Gzz(x0,yo) > 0, Gzz(%0,¥0)Gyy (%0,0) — [Gz,y@o,yo)]? > 0, 


then G must have a minimum at (xo,yo) because in this case 4 + e; must have the 
same sign as 4 and the right member of (7) must be nonnegative when |A| and 
|k| are sufficiently small. Similarly if (4) holds and 

(10) Grz(*osyo) <0, — Gez(0,¥0)Gyy(%o,¥0) — [Gey(%o,s'0)]? > 0, 


then G must have a maximum at (xo,yo) because in this case the right member of 


(7) must be nonpositive when |4| and |&| are sufficiently small. Jn case (4) 
holds and AC — B* < 0, that is, 


(11) Gaz (%0,¥0)Gyy(%0,¥0) — [Gey (x0,yo)]? < 0, 


the function G cannot have an extremum at (xo,Vyo). We omit proof of this fact, 
and we also omit discussion of the way in which Taylor formulas having more 
terms can (when the required derivatives exist) be used to discuss cases in which 


AC — B = 0. 


8 Supposing that a = 0 and b? — 4ac =Æ 0, find the extrema of the function 
G for which 


(1) G(x,y) = ax? + bxy + ey?. 
Remark: When solving problems of this nature, it is usually safe to set 
z = ax? + bxy + cy? 


and use the curly dee notation for partial derivatives. Outline of solution: The 


system of equations st = Q, z = Q0 is satisfied only when x = y = 0. The 
formulas 

ð?z 07z ð?z ðs N > 
(2) z2 = 2a, Ox Oy? — 3 ax) = ac — b 


and the italicized statements of Problem 7 show that z has a minimum at (0,0) 
if a > 0 and 4ac — b? > 0, that z has a maximum at (0,0) if a < O and 4ac — bo 
> 0, and that z has neither a maximum nor a minimum at (0,0) if 4ac — b? < 0. 
Remark: The examples 


(3) VA = x? + y}, g = —x? — y?, = x? — y? 
illustrate the three phenomena. The examples 


illustrate the cases excluded from this problem by the condition b? — 4ac = 0. 
9 Supposing that a = 0, determine the points (x,y), if any, at which the 
function G defined by 
G(x,y) = x? — 3axy + y? 
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takes extreme values. Ans.: The critical points where the first-order partial 
derivatives both vanish are (0,0) and (a,a2). At (0,0), G has neither a maximum 


nora minimum. At (a,a), G has a minimum if a > 0 anda maximum ifa <0 
Moreover, G(a,a) = —a?. 
10 Assuming that 


G(x,y) = ao + bix + doy + cix? + coxy + egy? 
+ dix? + dox*y + dzxy? + dax? + ext + > 
over some neighborhood of the origin, and that the series can be differentiated 


termwise with respect to x and y as often as desired, determine enough coefficients 
to verify three or four terms in the expansion 


G(x,y) = G(0,0) + [G.(0,0)x + G,(0,0)s] 
+ z [Gz2(0,0)x? + 2Gz,(0,0)xy + Gyy(0,0)¥7] 


1 
+ 31 [Gz22(0,0)x + 3Gz2y(0,0)x*y + 3Gryy(0,0) xy? + Gyyy (0,0) y3] tree, 


J1 Suggest two or more ways to obtain the power series in x and y which 
converges to ¢7t¥ and use each method to obtain a few of the terms. 


12.6 Euler-Maclaurin summation formulast As we shall see, we 
need only one very simple idea to obtain some remarkably useful and 
important formulas involving the Bernoulli functions and numbers. 
While the index reveals locations of more information about Bernoulli 
functions and numbers, we start with the facts that 


(12.611) Bo(x) = 1 

(12.612) Bi(x) = Br_1(x) (n = 1,2,3, - + -) 
(12.613) Jy Bale) dx = 0 (n = 1,2,3, > +») 
(12.614) B,(x + 1) = B,(x) (n = 0,1,2, - + -) 


over the interval — © < x < œ, except that (12.612) fails to hold when 
nis l or 2 and x is an integer. The function B,(x) is the saw-tooth func- 
tion having the graph shown in Figure 12.62. The Bernoulli numbers 


Figure 12.62 


f This section can be omitted. It is not claimed that the section is easy. It is not 
claimed that the material can be thoroughly digested in a day or a week ora year. It is, 
however, claimed that students of calculus should see a substantial and useful application 


of calculus. Even though we build only more modest structures in examinations in this 
course, we should see at least one cathedral. 
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B, are defined by the formulas 


(12.63) B, = n\B,(0) (n 2 2) 
and 
(12.631) Bo = 1, Bı = —4, = 4, B, = 0, B, = -75 B; = 0, 


Be = iy, Br = 0, Be —z5, By = 0, Bio = 3%, 


Without yet knowing what is going to happen, let p and g be integers 
and let f be continuous and have all of the derivatives we want to use 


over the interval pS x S q. Letting k be an integer for which p S k 
< q, we start with the simple idea that 


k+1 k+1 
(12.64) I, fle) dx = f°" fe) Bolx) dx 
and that we can modify the right side by integrating by parts with the 


aid of (12.612). However, since B(x) is discontinuous at k and k + 1, 
we must be careful. Accordingly, we put (12.64) in the form 


(12.65) S 1@ ax = im f, EHIE 60) Bo(x) dx. 


Setting 
u = f(x) dv = Bo(x) dx 
du = f'(x) dx, v = B(x) 
gives 


(12.651) MEO) dx = hm 1 SOB |e vies 
— A BLOLAO dæ} 


and hence 
(12.652) Í ETE ton) dx -IA tt — ii fe) Bulx) dx. 


Adding the members of (12.652) for integer values of k for which $ S 
k < q — 1 gives the more useful identity 


q q 
(12.653) l tla) de = Y f) _ fe) + fO — Jl P )Bilx) dx. 
Pp k=p Dp 
Transposing gives the basic Euler-Maclaurin formula 


aso ¥ sa = [pe ax + OTM + [TOB 
k=p Pp Pp 
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which is used to estimate sums. More Euler-Maclaurin formulas 
(12.661) È fE) = Í f dx + PEIO 5 LO FO 

— POBRE dx, 
(12.662) D f(k) = Í ° f(x) dx +f ho 4 ED = (2) 
+ [P OB dx, 


etcetera, are easily obtained by further integrations by parts. The 
formula obtained after m integrations by parts is 


" — | tx) de 4 SO) TIO 
(12.663) x A) = Í f(x) dx + = 


4 È [F9 (g) — IOE 4+ (—1)™41 Í ? f™ (x) B(x) dx. 


This formula reduces to (12.66), (12.661), and (12.662) when m is 1, 2, 
and 3. We must observe that B, = 0 when7is odd andj = 3; otherwise, 
some of the signs in (12.663) would be wrong. 

In some important applications, f™(x)— 0 and as x —> © and the 


integral in 


12.664) C, = LP) — Š pogi t (onm f7 ABa) a 
722 J: p 


exists when m is sufficiently great, say m = mọ Z 1. In such cases, 


we can define the constant C, by the formula (12.664), the right member 
being independent of m because integration by parts shows that it is 


unchanged when m is replaced by m+ 1. Subtracting (12.664) from 
(12.663) gives the formula 


q 
(12.665) Ý fle) = C+ | fe dx + 
k=p P 
m ; B, o 
+ $ a at + CD” l f™ (x)Bnlx) dx. 
3=2 ° q 
Solving this formula for C, gives the formula 


(12.666) C, = ) f(k) — f r dx - 19) 
k=p P 


— », FEO) 3 — (—1)" Í > f™®(x)Bm(x) dx, 


which is sometimes used to calculate approximations to Cp. 
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In many practical applications, the values of the integrals 
q o 
(12.67) J IOB) de, fÀ f(x) Bale) dx 


are not known, but this makes no difference because their algebraic signs 
can be determined and we discover that numbers we want to calculate 
lie between known numbers that are extremely close together. For 
example, the natures of B;(x) and Bs(x) are such that if f(x) and f(x) 
are positive and decreasing, then the integrals in (12.67) are positive 
when m = 3 and are negative when m = 5. 

The first problems at the end of this section give some of the simple 
applications of the Euler-Maclaurin summation formulas. In Problem 
4 we shall derive some very important Stirling formulas. In Problem 5 
we shall give an elementary proof of the fact that if w, < we and 


Į 
(12.68) Palono) = y 5 “com 
stoVa sks yn 
then 
(12.681) lim Palonen) = a) Í * an dx. 
n— o T Jar 


This and related formulas are very important in probability and statistics. 
Persons who have or will have interest in these matters are well advised 
to complete this course and proceed to study an authoritative textbook 
by Fellerf which many people read just for the fun of it. 


Problems 12.69 
1 Show that putting p = 0, g = n, and f(x) = x? in (12.662) gives the 


formula 
) pa n(n + 1)(2n +1) 
k=l 6 
2 Supposing that s is a positive integer, show that putting p = 0, g = n, 
m = s, and f(x) = xin (12.663) gives the formula 


ae TA 
2 kassitit de DE- CO j+ Damn 


Remark: The result can be put in the neater form 


Š ht = ni +r $ ($ H) ams, 


t William Feller, “An Introduction to Probability Theory and its Applications,” John 
Wiley & Sons, Inc., New York, 1957. 
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involving binomial coefficients. For example, putting s = 3 gives 


Yo kè = n? + nC) + 4n%(—4) + 60°C) + 4n (0), 
k=l 

so 

ype = Bn + I)? 

2 =| 


k=l 
Putting s = 4 gives 


Yi ki = nt + Hnd + Sn(—4) + 10n9(8) + 107200) + Sa(—3)], 
k=l 
SO 


< i= n(6n4 + 15n3 + 10n? — 1) 
Lr 


_ n(n + L(a + 1) (3n? + 3n — 1). 
E 30 


Some people spend huge amounts of time working out these formulas by other 
methods. 


3 Show that setting p = 1, g = n, f(x) = x7}, m = 3, and Cı = y in (12.665) 


gives the formula 


a1 1 1 r= B(x) 
0 Xgrbenty tag ime tof, e 


- 


dx, 


where y is the Euler constant. Show that setting p = 1, g = 10, f(x) = x7}, 
and C, = y in (12.666) gives the formula 


1 “G IB o m! 
= ) -— log 10 — ay — = | — Bn(x) dx. 
(2) » k 88 + 1=2 10? j 10 xt (x) dx 
Remark: Knowing that 
log 10 = 2.30258 50929 94046 ---, 


it is not difficult to push a pencil through the calculations by which (2) is used 
with m = 10 to obtain the first 10 decimal places in 


y = 0.57721 56649 01532 86061 


When y is known, (1) enables us to make very close estimates of the sum of 
10 or more terms of the harmonic series. 


4 Supposing that z > —1, show that putting p = 1, g = n, and f(x) = 
log (z + x) in (12.663) gives, when m = 2, 


(1) 2 log Œ+ E) = (+n +3) log (z+ n) — (+4) log (g@+1)—n+1 


B fp 1 dy po mn = 1B) 9, 
PREET cael t J, Gta 
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Remark: We proceed to show how this formula can be used to derive 1 ery impor- 
tant formulas involving factorials. Putting z = 0 in (1) gives 


(2) log n! = (n+ 4) logn—n+1 
B, 1 n — 1)!Bn 
| -1]+ f (m — 1)!B (*) gy 


g=2 (j — Lg ban x™ 


One of the truly great mathematical discoveries is the fact that (2) can be 
improved with the aid of the formula 


_ (2a)! V nT _ s. 
©) an Gy T 


see Problems 9.59, Problem 6, equation (13). From (3) we obtain 


(4) lim {log (2n)! + log ~/nm — 2n log 2 — 2 log n!} = 0. 
n— œ 


In this formula we substitute the expression for log n! given in (2) and the expres- 
sion for log (2n)! obtained by replacing n by 2n in (2). The result should not 
overwhelm us, because we can overwhelm it. Many of the terms cancel out, the 
remaining ones have limits, and (4) reduces to 


an © _ B, ff? (m—1)!Ba(x) , 
(5) log da 1 +) (j — 1)j 1 ym -7 dx = 0. 


Since 


n (m — 1)!Bm(x) dx — > (m — 1)!Bm(x) de 


o [e= Te 
= (m —1)'Ba@) , _ o (m — 1)!Bu(x) 
=- fj a dx = — o e F ny F ny" dx, 


adding the left side of (5) to the right side of (2) gives the remarkable Stirling 
formula 


(7) logn! = log VIr + (n +4) logn — n 
©“ B, 1 = (m — 1)!Bn() 3 


z=2 (J — lpn Jo (m+ x)™ 


To produce a Stirling formula applying to factorials of noninteger numbers, we 
suppose that z > —1 and use the definition 


g tei n\n? l 
8) mm e GF Ie +2) @ EH) 


which appeared in Section 3.3, Problem 11. This is equivalent to 


(9) log z! = lim [log n! + zlogn — 5 log (z + &)]. 
k=l 


n> œ 
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In this formula we substitute the expression for log n! given in (7) and the expres- 
sion for the last sum given in (1). With the aid of the fact that 


(10) lim (z +2 + $)log (z + n) ~ log n] = z, 


we find that 


(11) log z! = log V 2r + (z + $) log (z + 1) — (z +1) 
B; 1 © (m — 1)!B,, 
+ `, - 1 a f (mn — 1)'!Ba(x) 
jo2 (J — j @ +1) 1 (z + x)” 
In case z > 0, we can add log (z + 1) to both sides of (11) and then replace z 
by z — 1 to obtain the alternative formula 


dx. 


(12) log z! = log Vr + (z +3) log z — z 


© B; 1 _ f> (om — 1)!Balx) 
+ -i h (z + x)” a, 


j=2 (J — 1)j zi 

which reduces to (7) when z = n. The derivation of Stirling formulas is not 
yet complete. To finish the task, we must study the theory of analytic functions 
of a complex variable. It will then be possible to observe that the members of 
(12) are analytic over the set consisting of complex numbers which are neither 
O nor negative. The principle of analytic extension then implies that the mem- 
bers of (12) are equal for each z which is neither 0 nor negative. We conclude 
with some remarks about (12). Let 


We can then put (12) in the forms 

(14) log z! = log V2zr + log z7 — z + E(z) 
and 

(15) zl = ~/ 2am eee, 


The formulas (12) and (14) are Stirling formulas for log z!, and (15) is the Stirling 
formula for z!. In many applications, E(z) is so near O that it can be neglected. 
Information about E(z) is obtained from (13). When n = 1, the first sum in the 
right member of (13) contains no terms. Hence putting m = 1 in (13) gives 


(16) E(2) = ~ N Aie) dx. 

Putting m = 3 and 5 and 7 in (13) gives 

(17) E() = = — Í, j ei dx 

0 ob- | Behe 

(19) EO =m- jam t nae fy Ce am 
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and more formulas can be produced by giving greater values to m. While these 
formulas have important applications to cases in which z is a complex number, 
we confine our attention to the case in which z is real and z > 0. When pisa 
positive integer, the function (z + x)~? is then positive and decreasing over the 
infinite interval x 20. Hence properties of the Bernoulli functions (those 
revealed in a problem at the end of Section 5.3) enable us to show that the integrals 
in (16), (17), (18), (19) are respectively negative, positive, negative, positive. 
Hence 


1 1 1 
E(z) > 0, E ——,) — — —__, 
(20) (z) (z) < Ik E(z) > in aoz 
and 
1 1 1 
9 -n a —— e 
(21) EQ) < 12z 360z? + 126025 


Even when zis as small as 2 or 3, this gives remarkably precise information about 


E(z). In very many applications of these things, z is a positive integer n and 
(15) is put in the form 


(22) n! = yV 2nr ne Reba! lan. 


where 0 < 6, < 1 and 6, is near 1 whenever n is large. In fact, putting E(z) = 
E(n) = 0,/12n in (20) and (21) shows that 


1 1 1 


Thus 6, is quite close to 1 even when n = 1. 
5 When nisa positive integer, we can put x = y = 1 in the binomial formula 


a ` n! akok — A SNN a 
(1) (« + y) -= È ma-i” r= BE kyk 


to obtain the formula 
A 1 /n 
(2) = (5) =} 
2, 27 \ k 


To obtain more information about the terms in this sum, particularly when k 
is roughly 2/2, we start with the formula 


(3) log z; (7) = —n log 2 + log n! — log k! — log (n — k)! 


The last three terms can be calculated from the Stirling formula 


(4) log n! = log V 2r + (n + $) logn — n + En 


and the results of replacing n by k and by (n — k) in it. The error term En, 
which can be approximated very closely with the aid of formula (13) of the 
preceding problem, is about 1/12n even when n is quite small. Except when n 
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and k are quite small, it is advisable to substitute in (3) and simplify the result 
before making numerical calculations. Thus we can put (3) in the form 


(5) log =. G)- log 4] 2-( + 5) log = 


— (n- b+ 5) log 2(1 = =) +n 


where En, = En — Es — En, Much progress in probability and statistics is 
based upon the idea that when & is near n/2, we can represent it in the form 


(6) k=3 +À Vn, 


where À is a number that depends upon k and n. When (6) holds, we find that 


and we can put (5) in the form 


© 13 (4) =e Z- (+a va +4) (1+ 2 

- (3-A va +3) ls (1 = Se) + n 
and hence 
(9) log =. (7) = log |= — LEL log (1 — 2N 


~ va [log (1 + >) — log (1 - > | + Ba 


Now let æ be a positive number and suppose that k is close enough to n/2 to 
make 


(10) z aVnsk=3+AvVns5+avn. 


Then |A| S æ and we will have 


(11) mM 2 | se <} 


provided n is sufficiently great, say n = mo. It can be shown that there is a 
constant Mı such that, when |x| < 4, 


(12) log (1 — x) = —x + Fi(x)|x|? 
(13) log (1 + x) — log (1 — x) = 2x + F.(x)|x]3, 


where [Fi(x)| S$ My and [Fe(x)| S My. These facts and (9) yield the conclusion 
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that there are constants M and D,,; such that [Duel S M and 


l fn 2 D 
log — ; = 10 — — -+ nk, 
(14) og Fn ( ) l g J 22 


This shows that if (10) and (11) hold, then 


(15) eM In I~ —2)2 <> 7 (7 ) s < eMn Ka eTA, 


Suppose now that xı and x, are two numbers, not necessarily positive, for which 
xı < x2 and suppose that a has been chosen such that |x| < œ and |x| < a 
Let 


(16) P(x1,x2) = > a 9) 
stan Sk SStawvn 


Since lim e~¥/" = I and lim e¥/" = 1, it follows from (15) that 
n> © n 00 


(17) lim Palxix2) = lim $” q- enn? 


provided the limit on the right exists. In (17) and elsewhere, a star on a sigma 
indicates that the range of summation is the same as that in (16). For present 
purposes, let the number A in (6), (15), and (17) be denoted by Ay, so that 


k = 5+ da V2. 


Then An, — Anni = 1/ Vn, and consequently 


(18) y A = en = y NE eax (Nak — An,k—1)- 


The right member of (18) is, except for negligible discrepancies at the ends of the 
interval xı S x S x2, a Riemann sum which converges to the right member of 
the formula 


(19) lim y 2 enk = Re Í ME gz dy, 
n= © nT TN Jı 


Therefore, (19) holds. From (19), (17), and (16) we obtain the formula 


° 1 n _ 2 T? —222 d 
@ m o aaea 


z trn Sk Soter/n 


= =2 


In order to compare this result with other statistical results, we replace xı and 
x: by wı/2 and we/2 in (20). Changing the variable of integration by setting 
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x = t/2 then gives the formula 


(21) lim X 


noe n 1 nr , 2 
str PER 


° e-?/2 dt, 


(<del. 
k 2m Jow 
The right side of (21) can be evaluated with the aid of tables giving values of 


1 Ww 
— = [ e7Pl2 dt 
VJ 2x J0 
for various values of w. 
6 Our educations are not quite complete until we have seen the formula by 
which the power series expansion of tan x is expressed in terms of Bernoullj 


numbers Letting Bo(x), Bi(x), - - +- be the Bernoulli functions, we start by 
deriving the formula 
tet ` 
(1) Ta = dy Bale 
n=0 


which holds when 0 < x <1 and O < |:| < 2r. To simplify matters, we can 
suppose at first that 0 < |z| <1. When y(x) denotes the right side of (1), we 
can differentiate termwise to obtain 


(2) ye) = È BiG = D Basle 

= PA Brt = t X B,(x)i® = ty(x). 
Therefore, 
(3) £ piya] = ey") — @)] = 0, 


and it follows that for each z there is a constant c(t) for which 


(4) ezty(t) = c(t) or y(t) = c(t)e**. 
Therefore, 
(5) cHe = ) BP. 


Integrating (5) over the interval 0 < x < 1 and using (12.611) and (12.613) give 
the first and then the second of the formulas 


e—1 t 
(6) c(t) = 1, c(t) = TIT 
Substituting in (5) then gives (1). Since By(x) = x — $ when 0 < x < 1, we 
can subtract (x — 3) from both sides of (1) to obtain the formula 


é + Det —1 — Ix(et — » 
(7) see) = D Ba (Jt 


n =0 
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in which the star on the 2 means that the term for which n = 1 is omitted from 
the series. More advanced mathematics contains theorems which allow us to 
take termwise limits, as x approaches zero through positive values, of the mem- 


bers of (7). This, the fact that 
(8) lim Ba(x) = B,(0) = Bn 
x—>0 n! 


when n Æ 1, and the fact that B, = 0 when n ıs odd and n = 1, give the formula 


teetl | Be B 
(9) Sap Bot et et ey -> atk 42k 


which is valid when 0 < |:| < 2r. Putting ¢ = 2z in (9) gives the formula 


ai a+ et -È 2B ox o 


(10) TH! zk 


Z =— gz 


which is valid when 0 < |z| < r. The theory of functions of a complex variable 
provides reasons why (10) is valid when z is a complex number for which 0 < 
|z] < T, and we can put z = 76 to obtain 


gf ete =s 22k B 
(11) ið -X gpi CO” 
when 0 < |6| < m. Since i = (i?) = (—1)*, use of the Euler formulas given 
in Problem 20 of Problems 12.49 enables us to put (11) in the form 


p 2 * Bx 


py 02k 


(12) 0 cot 6 = ) (—1) 
k= 


when 0 < |0| < m. Having established (12), we can show that 
(13) 6 tan 0 = 6 cot 0 — 20 cot 26 
when 0 < [0| < 2/2 and use (12) to obtain our final formula 


< 22k (2% — 1) Bo 
14 tan 6 = 1) = 92-1 
(14) » (-1) On! 
which is valid when |0| < 2/2. The above formulas and modifications of them 
appear in quite elementary tables, but we must always be prepared to observe 
that some brief treatments of Bernoulli numbers use B, to denote the number 


Bo,(0)/(2k)! which we have called By. 


Iterated and 
13 multiple integrals 


13.1 Iterated integrals When we differentiate a function f having 
values f(x) and then iterate (or repeat) the process, we obtain functions 
having values denoted by f'(x), f” (x), f” (=), . When we iterate 
the process of integrating a function f over an interval from a to x, we 
need more elaborate notation to express the results in terms of the given 
function. ‘To begin operations, we suppose that 


(13.11) fi) = [FO ae, fale) = [PAO a, 
fale) = [fold d, 


To express fz in terms of f, we can avoid snarls of various kinds by replac- 
ing ¢ by tı and then x by t to obtain 


fat) = fi FE) dt. 


631 
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Substituting this in the formula for f(x) then gives 


(13.12) fo(x) = J (Site) dt,) dt. 


Replacing tı by #2 and then ż by tı and x by t gives 
t ti 
f(t) = f (f. f (te) di) dt, 


and substituting this into the formula for f3(x) gives 


(13.13) fale) = fa fe (SEE) dt) dts} d. 


The integrals in (13.12) and (13.13) are examples of iterated integrals 
and, for each n, we could write a formula for fa(x) which involves 7% of 
these integrals. 

In case f(x) = 1 for each x, we do not need the iterated integrals to 
obtain formulas for fı(x), fa(x), - - - ; we can use the formulas (13.11) 
one after another to obtain 


(13.14) fil) =*x-—a4, fax) = sa falx) = ee 
faz) = EO, 


We must, however, learn how iterated integrals are manipulated when 
they appear in our work and cannot be avoided. The way in which 
(13.12) was obtained tells us that to get f(x) we should integrate first 
with respect to #; to evaluate the integral in parentheses to obtain a 
function of t which is integrated with respect to ¢ to obtain fe(x). A 
tempest in a teapot appears when we, like everyone else, adopt the view 
that parentheses are nuisances and write (13.12) in the form 


(13.15) fale) = fE fi F) dn a 


and insist that we must find f(x) by integrating first with respect to t, 
the limits of integration being a and t, and integrating last with respect 
tot from a to x. The difficulty lies in the fact that there is always the 
possibility of constructing a theory of iterated integrals in such a way 
that, for example, 


(13.16) JE [o Fen) dx ay 


means the result of integrating first with respect to y from C to D (not 
from Æ to B) and then integrating last with respect to x from Æ to B. 
It is equally sensible to insist on one hand that we should “work outward 


D B . 
from the middle,” so that f goes with dx and f; goes with dy, and to 
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insist on the other hand that we should “work from right to left,” so 
D B 
that Í ç goes with dy and J 4 goes with dx. Of course, we could lengthen 


a long story by insisting that we should always keep the parentheses and 
avoid the tempests and the stories, but this is impractical. While we 
reserve the option of using parentheses whenever we wish to do so, we 
ordinarily remove parentheses and ambiguities from iterated integrals 
by writing the integrals in such a way that each integral sign except the 
one on the right is immediately followed by the symbol showing the 
variable of integration which has the limits of integration appearing on 
the integral sign. Thus, for example, 


2 4 2 y3 y=4 
(13.17) faf nyt dy = | ax| =| 
1 3 1 3 
2 


1 
and 
x t x t— 2 just 
(13.181) I at | ( — u) du = i d| - es 
0 0 0 u=0 
1 z 1 {3 jé==x x3 
= = {2 dt =e = —. 
rf a5] 5 
Note that, in each case, the integral appearing on the right is evaluated 
first. Note also that when we are in the process of integrating with 
respect to a particular variable, all other variables are temporarily con- 


sidered to be constants. Opportunities to become familiar with these 
things are provided by the following problems. 


Problems 13.19 
1 Show that 
1 Ze, > 4 1 te a _ 1 
(a) fp df (xt + y2) dy = 4 OA at fC +u) du = —— 
, (n > —1) 
(c) f dx [dy [de = 4 (d) h d [PO edy a1 


2 By evaluating all of the integrals involved, show that 


1 2 2 1 1 2 2 1 
(a) [de fy x dy = Pay [Ox ae ©) h ae [Peay = f? dy f xd 
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3 By evaluating all of the integrals involved, show that 
t z t 
I dx f f(y) dy = iA t — yY) dy 
when 
(a) p> —l and fy) =y 


W) k>O and fly) =e 
(c)o# 0 and f(y) = sin wy 


I d t t 
o“ v= wl — fy 2 


which abbreviates the formula 


4 The formula 


ih u(x)o’ (x) dx = uol) | _ h o(x) u! (x) dx 


for integration by parts, has unexpected applications. Assuming that f is con- 
tinuous and 


I= fy dæ fy FO) d 


find the result of integrating by parts with 
T 
u= hody @) = 1 


u'(x) = f(x), o(x) = — (t — x). 
5 Calculate the two integrals J and J defined by 


I= h d k d _ k d f? d 
~ Jo x 0 f(x,y) Ys J — Jo Y Jy fey) X 
and show that they are equal when 


(a) p> —l,g>—1 and = f(x,y) = xy 
(b) f(xsy) = et 


6 Show that, when n > —1, 


1 1 J Qnt2 — 2 
J, ax fa ( + y)” dy = (n + 1)(n + 2) 
7 Show that 
1 1 1 
— dy = 2] . 
Í dx f P 2 log 2 
8 Show that 


li T 1 d 
= = ©, 
J, ECETIA 


9 Supposing that 0 < p < 2and p #1, show that 


1 pfi 1l 22 — 2 
J, ax J, GFP? = C-l- 2) 
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10 Supposing that n > —2 and n =Æ 1, show that 


1 x Qatl — Í 
J de f EEE DEED 
Investigate the case in which n = —1. 
11 Show that 
1 2z 3 1 1 
Ya _? Y h= 

(a) fo ax f ? dy a (0) f æf Za 20 
(©) f, a fo"? af ax [Fa =] 


12 Show that making one integration gives the formula 


ae {7 dy =2 f 41 l 4 
2h |, T5? =? h Ira 


which, so far as we know, has dubious validity because the last integrand is 
meaningless when x = 0 and when x = 1. Show that 


z ] 
eats 


has the value 0 when x = 0 and does not exist (or has the value +) when 
= 1. Then proceed to the next problem. 
13 The integrals in 


1 x 1 
(1) 2f ae iets 


cannot exist as iterated Riemann integrals because the integral in 


(2) 1 = [Se 


does not exist as a Riemann integral when x = 1. However, when 0 S$ x < 1, 


(3) fe) = [Fd + ay Hay? tat t+) dy 
=|[yt + peer te 


xe x8 x 
= x — — — s . 
+ 7 + 3 + 7 + 
lo each positive integer n there corresponds a positive number ô such that 


DE 


(4) Izr t +5 


when 1 — ô < x < 1 (why?) and hence (why?) 


(5) lim f(x) = 
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The integral in (1) will exist as a Riemann-Cauchy integral and will have the 
value V if 


(6) 4 


[AES ees a 


r+ 1— 


: r? rt r r 
lim (Btitatő+ .. ‘). 


Ii 


r>1i— 


With the aid of the basic fact that Z(1/n?) = 12/6, prove that V = a?/6. Hint: 
Supposing that 0 < 7 < 1, let the last series in (6) converge to g(r) and begin 


Nn 
by showing that g(r) < 1?/6. Then compare g(r) with y 1/k? — ¢/2 
k=1 
14 Prove that the formula 


f dx fi Kew) dy = f: dx f g(x,y) dy + f: dx ik h(x,y) dy 


is valid provided (i) f(x,y) = g(x,y) + h(x,y) when aSxSbandeSy Sd 
and (ii) the integrals in the right member exist. Hint: Let 


F(x) = Mic dy, G(x) = J, ° g(x,y) dy,  H(x) = I, * hly) dy. 


and use known facts about simple integrals. 


15 Let 
(1) F(x,y,2) = P(x,y,z)i + Q(x,y,2)j + R(x,y,z)k, 


where the functions are continuous over some spherical ball B with center at 
(x0,v0,20), be the force on a particle when the particle is at the point (x,y,z). 
Show that the work W,(x,y,z) done by the force in moving the particle along line 
segments from (%o,yo,%0) tO (%0,7,%0) and then to (xo,¥,z) and finally to the point 
(x,y,z) in the ball is 


O) Wily) = f? Olobi) dB + [* Rew) dy + f* Playz) da 


and that 


OW 
(3) =z = P(x,4,2). 


Show that the work W2(x,y,z) done by the force in moving the particle along 
line segments from (%0,'y0,%0) to (x,Vo,20) and then to (x,y,z) and then to (x,y,z) is 


O Waya) = [7 Plerute) da+ [Revo dy + fy OBa) 48 


and that 


6) oF = Oley): 


Show that the work W3(x,y,z) done by the force in moving the particle along line 
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segments from (xo,yo,Z0) to (%*,¥o,%0) and then to (x,¥,20) and then to (x,y,z) is 


(6) Waley) = f 7 P(a,yo,t0) da + f ” 0(x,8,20) dB + f. R(x, y) dy 
Zo yo 20 
and that 


OW. 
(7) Oz = = R(x,4,2). 


Remark: A force field is called conservatise if the work done in moving a particle 
around a closed curve is zero or (what amounts to the same thing) if the work 
done in moving the particle from one point to another is the same for all paths 
running from the first point to the second. If F is conservative, the functions 
in (2), (4), and (6) are equal and we may set 


(8) W (x,y,z) = Wi(x,y,z) = Wa(x,y,2) = Ws(x,y,2). 
Then (3), (5), and (7) give 


OW OW OW 
0) =a L= gy R= ae 
SO 
OW 
(10) Fait it ko VW 


and F is the gradient of W. If in addition, the particle being moved is a unit 
mass in a gravitational field or a unit charge in an electric field, then W is called 
a potential function, the potential at the point (x,y,z) being W (x,y,z) or W (x,y,z) + 
C, where C is a constant. It is sometimes important to know that if P, Q, R are 
the scalar components of a conservative vector function and if they have con- 
tinuous partial derivatives of first order, then (9) implies that 


(11) ðQ OP OR OP OR ðQ 


es oe eee -me oee eee — m = $ 


Ox ay’ Ox ðr dy  ðz 


More information about this matter appears in Problem 10 of Section 13.3. 
16 Assuming that all of the integrals exist, tell why 


oze) a f(x,y) 
foe «fro 4 yf F 1(«)Fe(x,y)Fs(x,y,2) dz 


(x,y) 
b > 
l [ty an [rena ay [Ps 
1( 
17 Calculate 


ð? z y 
z5 l, ds f u(s,t) dt 


for the case in which 


(a) p Z 0, 4 20, u(s,t) = sri8 
(b) u(s,t) =s+t 
(c) u(s,t) = e sin t 
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Ans.: In each case the answer is u(x,y), 
stand why this should be so. 
18 Try to understand the formulas 


and it may be worthwhile to try to under- 


Í " uy(%,t) dt = u(x,t) P- = u(x,y) — u(x,b) 


and 


f ds IN Uz, y(s,t) dt = f: ds [eo 


f: | wels9) — u (5,8) | ds = [utes — u(s,b) Jz 
u(x,y) — u(a,y) — u(x,b) + u(a,b). 


li 


13.2 Iterated integrals and volumes In the study of iterated and 
multiple integrals and their applications, we continually need unin- 
herited skills and information that can be efficiently acquired by making 
a calm and thorough examination of matters relating to Figure 13.21. 


Figure 13.21 


We start with the idea that the graphs of the two equations y = «/2 and 
y = x?/4 intersect at the points (0,0) and (2,1). These graphs separate 
the closed rectangular region R, consisting of points (x,y) for which 
OS*52,0 <y S1, into three subsets S1, S2, S}. While it makes no 
difference how disputes over ownership of boundaries are resolved, we 
want them resolved in some way and we suppose that S contains each 
of its boundary points. Thus S% is the set of points (x,y) for which 0 < 
x S 2 and x?/4 S y S$ x/2. Then S; is the set of points (x,y) for which 
0 <x S2 and0 <£ y < x*/4, and S; is the set of points (x,y) for which 
OS“%<2andx/2<y1. 

While more recondite modifications of the construction are easily made, 
we keep our example simple by supposing that 4, B, C are three constants, 
that f(x,y) = A when (x,y) is a point of Sı, that f(x,y) = B when (x,y) 
is a point of Se, and that f(x,y) = C when (x,y) is a point of S3. Subject 
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to existence of the integrals involved, we investigate the number / 


defined by 


(13.22) I= h dx h fey) dy. 


Our first step is to recognize that, for each fixed x in the interval 0 < 
x <2, the integrand in the integral 


(13.221) h fey) ay 


has values that depend upon y. To make effective use of our informa- 
tion, we mark a point x on the x axis between 0 and 2 and then draw a 
line through this point parallel to the y axis. A part of this line is in 
Sı where f(x,y) = 4, another part is in S where f(x,y = B, a third 
part is in Są where f(x,y) = C, and, moreover, the end points of these 
parts depend upon the fixed x. We must understand our situation so 
thoroughly that we see that 


f(xy) = A when 0 <S y < x*/4, 
f(x,y) = B when x?/4 S y <S x/2, 
f(x,y) = C when x/2 < y S1, 
and that 


1 22/4 z/2 1 
(13.222) f flay) dy = [Adyt [Bay t+ [Cay 


SO 


a23 [ienaa +3(F-F) +e (1-3) 


Substituting this in (13.22) gives 


2 x? x x x 


and hence 
(13.225) Il=4A+4B+C. 


Supposing that 4, B, C are nonnegative constants, we proceed to 
show that the number J is the volume of the solid block H which stands 
upon the rectangular base R of Figure 13.21 and has, at each point 
(x,y) of R, height f(x,y). It is quite possible to imagine that the rec- 
tangular set R of Figure 13.21 lies in a horizontal plane beneath our eyes 
and that the block H has its base on R and extends upward toward our 
eyes. If the operation is helpful, we should imagine that we are in an 
airplane and are looking down upon a hotel built upon the set or site R; 
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the parts covering Sı, Se, Sz have heights 4, B, C. We should know 
that we can undertake to start with an x, Vy, % coordinate system oriented 


in the usual way and to sketch a figure like Figure 13.23 showing the 
block (or hotel) H. 


Figure 13.23 Figure 13.24 


To find the volume of the block H, we do not need a figure in which an 
architect could take pride. It is sufficient to use the slab method which 
was candidly presented and employed in Section 4.5. We make a parti- 
tion P of the interval 0 S x < 2 into subintervals of lengths Ax, Axe, 

, Ax, and let xf be a point in the th subinterval so that x-1 S 


xf S xr. For each k, the number 


(13.241) h F(=ky) dy 


is the area of the intersection of the plane x = x* and the body H. 
Depending upon the choice of x*, the number 


(13.242) Axi h F(xky) dy 


is exactly equal to or is an approximation to the volume of the slab of 
H between the planes x = x,_, and x = x, The sum in the formula 


di l 
(13.243) V= lim X Am f flety) dy 
k=l 


is then exactly equal to the volume V of H or is an approximation to V, 
and in any case the limit as |P| — 0 is X. Thus 


(13.244) V=I= N dx h f(x,y) dy 
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and we have shown that, in the special cases being considered, our 
iterated integral is the volume of the block H. 

Our work involving Figure 13.21 is only half finished because we must 
investigate the integral J defined by 


(13.25) J= h dy f, Ee) dx 


and must assimilate some new ideas. For each y in the interval 0 < 
y <1, the integrand in the integral 


(13.251) h f(xy) dx 


has values that depend upon x. As in Figure 13.21 we mark a point y 
on the y axis between 0 and l and draw a line through this point parallel 
to the x axis. A part of this line is in S3 where f(x,y) = C, another part 
is in S2 where f(x,y) = B, a third part is in Sı where f(x,y) = 4, and, 
moreover, the end points of these parts depend upon the fixed y. We 
must examine Figure 13.21 carefully enough to see that 


f(x,y) = C when 0 S x < 2y, 

f(x,y) = B when 2y S x < 2 Vy, 

f(x,y) = 4 when 2 Vy <x S 2, 
and that 


(13.252) ik fly) dx = Í, YO dx + ine B dx + A A dx, 


SO 
(13.253) ff æy) dx = 24(1 — Vy) + 2B(Vy — y) + 20y. 


Substituting this in (13.25) gives 


J=2 h (Ad — V5) + By — y) + Cyl dy 


and hence 
(13.26) J =#A+ 4B + C. 


Comparison of (13.225) and (13.26) shows that J and J are equal. 
When 4, B, C are nonnegative, J is the volume of the block H and it 
follows that J must also be this volume. We need the experience gained 
by using the slab method to prove that J, like J, is the volume of H so we 
will have another and more informative proof that J = J. Let P bea 
partition of the interval 0 S$ y < 1 into subintervals of lengths Ayu, 
Ays, © + * , Ayn and let yf be a point in the kth subinterval such that 
Ye~1 S Ye S yx. While we should acquire the ability to do our chores 
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without benefit of elegant figures, we 
can use Figure 13.27 to help us see 
what we are doing. 

For each &, the number 


(13.271) fy fleyt) de 


is the area of the intersection of the 
plane y = yg and the body H. De- 
pending upon the choice of y¥, the 
number 


(13.272) Ay: E dx 


is exactly equal to or is an approxima- 
tion to the volume of the slab of H between the planes y = y,_1 and 
y = yx The sum in the formula 


Figure 13.27 


(13.273) V = lim `, Ay h f(x,y) dx 
k=l 


is then exactly equal to the volume V of H or is an approximation to /, 
and in any case the limit as |P| — 0 is VY. Thus 


(13.274) V=]= h dy h F(x,y) dx. 


For the special case being considered, equality of the last two members 
of the formula 


13.275) F=f de f f(x,y) dy = fi dy fy Key) dx 


is a consequence of the fact that each member is equal to the volume V 
of a solid body H. 

Some special applications of these ideas and formulas are particularly 
worthy of notice. In case 4 = C = 0 and B > 0, the number F is the 
volume of a body which rests upon the base S and (13.275) reduces to 
the formula 


2 2/2 1 24 
13.276) V= f d fia flay) dy = h d fa Key) ax. 


In case 4 = B =C = 1, the solid H has unit height over the whole 
rectangular set R and the volume of H must be the same as the area |R| 
of R. We could therefore be sure that there would be a mistake in our 
work if it were not true that the numbers / and J in (13.225) and (13.26) 
reduce to 2 when 4 = B=C=1. Incase 4 = C = 0 and B = 1, the 
numbers X, J, and J reduce to the area |S.| of the set S2. 

Among other things, the above example leads us to the following idea. 
Suppose we want to find the volume V of a solid body K which rests upon 
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a set S which is a subset of the rectangular set R containing points (x,y) 
for which aS x Sb, c S y Sd, the height of K at each point (x,y) 
in S being f(x,y). We can undertake to solve the problem in the fol- 
lowing way. Let f be extended in such a way that f(x,y) = 0 when 
(x,y) is a point not in the set S. Then, unless the set S and the function 
f are much more tortuous than those appearing in elementary nonpatho- 


logical problems, two applications of the slab method for finding the 
volume of Y yield the formula 


(13.28) v= [i dx [fey dy f, "dy f, ” fey) dx. 


It is very important to be aware that, in all ordinary and many extra- 
ordinary circumstances, the last two members of (13.28) are equal even 
when f is a discontinuous function for which f(x,y) = 0 when the point 
(x,y) is not in a particular set S in which we are interested. More 
information about this matter will appear later. 


Problems 13.29 
1 Let 


40 40 40 40 
r= fot f atend T= fag dy [gq flee) ae 


where f is continuous over the region R bounded by the graphs of the lines having 


the equations y = —l, y = x, and x = 1 and f(x,y) = O when the point (x,y) 
is notin R. Show that 


1 z 1 1 
r= fia | tena, T= fld f) Key) ax. 
Evaluate J and J and show that they are equal in case 


(a) f(x,y) = 1 (b) f(x,y) = x (c) f(x,y) = y 
(d) fy) =x +y (e) f(x,y) = xy (f) f(x,y) = x + y? 


when (x,y) is in R and f(x,y) = 0 when (x,y) is not in R. 
2 For each of the formulas 


(a) [ae f teey) dy = ff ae fE Key) ay 
o [ay [toa ax fy ay [i iEn) ax 
(0) f ae f° fey) dy = hi de [Z ies) ay 
(a) f° ae f° sony) ay = [ae JY? hes ay 
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find a region R such that the formula is valid whenever f(x,y) is continuous over 
R and f(x,y) = 0 when the point (x,y) is not in R. 
3 Find a region R such that the formula 


h dx S He) dy = I dy y f(xy) dx + I dy fin f(x,y) dx 


is valid when f is continuous over R. Evaluate all of the integrals and make the 
results agree when 


(a) f(x,y) =1 (b) f(x,y) =x (c) f(x,y) = y 
(d) f(xy) =x +y (e) f(x,y) = xy (f) fy) = x? + y? 


4 A particular solid body K can be described as the set in E} which rests 
upon the base S in the xy plane bounded by the plane graphs of the equations 
y = x/2 and y = x?/4 and has, at each point (x,y) in S, height x? + y?. The 
same body K can be described as the set in E which is bounded by the graphs 
(they are all surfaces) in Æ; of the equations 


y = x/2, y = x?/4, z = 0, z= x? +y 


This book tries to be too honest to pretend that it is easy to sketch a good figure 
showing the body K. The book does insist, however, that we should have picked 
up ideas enough to enable us to use iterated integrals in two diferent ways to 
find the volume |K| of K. Do it. Remark: The answers should agree with 
each other. Moreover, since the area of the base is $ and the height varies from 
0 to 5, the answers should be between 
0 and #. 

5 Asin Figure 13.291, let S be the 
closed set of points in the rectangle 
asxx2b,¢ Sy S dwhichis bounded 
below and above by the graphs of 
y = fi(x) and y = f(x) and which is 
bounded on the left and right by graphs 
of x = gi(fy) and x = ge(y) Let F a b z 
be a function which is continuous Figure 13.291 
over S and is such that F(x,y) = 0 
when (x,y) is a point not in S Tell why the first integral in the formula 


d S2(z) 
(1) [i PGs) a = fia Fo) ay 


exists and is equal to the second one when a £ x S b. Tell why the first integral 
in the formula 


g2(y) 


b 
2) f Fen dx = ff FEDE 
a gi(y) 
exists and is equal to the second one when ¢ Sy Sd Show that the formula 


(3) ih dx Í 2 Eley) dy = fi dy fi Fee) dx 
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reduces to 


b falz) d g2(y) 
(4) f. dx Í, A Fey) dy = Í, dy Í re Fey) dx, 


Finally, give geometric interpretations of the numbers in (4) when, for each point 
(x,y) in S, 


(a) F(x,y) = 1 
(b) F(x,y) = x? + y? 


6 By use of iterated integrals, find the volume V of the solid bounded by 
the three coordinate planes and the graph of the equation 


cn a žŽ> 
atpte=h 


Remark and ans.: The solid is a pyramid, and use of the fundamental fact that 
the volume of a pyramid is one-third of the product of its height and the area of 
its base gives V = $abc 

7 Even when details are efficiently managed, it is not a short task to find 
the volume V of the solid in the first octant bounded by the three coordinate 
planes and the graph of the equation 


xN y\4 z\*2 
6) +G) +6) = 
It is, however, worthwhile to try to manage details efficiently and to earn the 
satisfaction involved in showing that V = abc/90. It can be added that the 
world is wide enough to accommodate and even need persons who run amok or 
amuck and become strong enough to solve the problems obtained by replacing the 
exponent $ by $ and | and ¢. 
8 Prove that if the formula 


d 
(1) [pe [T UE) + Blay = [ay fi ey) + B] da 
is valid for some constant B, then 
d d 
(2) J. d [ Fea) dy = [ay [? fey) ae. 


Remark: In case f is a bounded function for which f(x,y) is sometimes positive 
and sometimes negative, we can choose B such that f(x,y) + Bis always positive. 
In appropriate circumstances, we can prove (2) by proving the first formula 
which involves only positive integrands. 


9 Let a and x be confined to an interval over which a given function f is 
continuous. Let fo(x) = f(x) and let 


D Ae fO O fO fhe = [Awa 


and so on so that, for each nm = 1, 2, 3, +- >, 


(2) fanla) = [E hO d. 
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Since 0! = 1, the formula 


(3) falx) = in —— f(t) dt 


is certainly correct when n = 1. Assuming that (3) is correct for a particular 


positive integer n, show that 


O fal) = [EO ry a 


and use (2) to obtain 


O fone) =f au [EO hey a 
Use (5) and Figure 13.292 to obtain 


x u 


(6) Jail) = in dt in a =I Z F(t du Figure 13.292 


and then use (6) to obtain the result of replacing n by n + 1 in (3). Since (3) 
is correct when n = 1, it must be correct when n = 2 and hence when n = 3 and 
hence when n = 4, and so on. 


10 This section should not leave the impression that our ideas about Riemann 


integrals can always be applied to Riemann-Cauchy integrals. It can happen 
that 


T= f def" sodas, T= f”, dy f? fee) ae 


both exist but have unequal values. For example, let 


f(x,y) = lwhenx 20,x-1l<y <x, 
f(x,y) = —lwhenx 20,4 <y <x 41, 
f(x,y) = 0 otherwise. 


Show that, in this case, J = O and J = 1. 


13.3 Double integrals Section 4.2 showed how we partition intervals 
into subintervals to form Riemann sums and how we use these sums to 
define Riemann integrals over one-dimensional intervals. Because the 
idea is important in both pure and applied mathematics, we must learn 
about the process by which a set S in a plane is partitioned into subsets 
in order to enable us to form Riemann sums and define Riemann integrals 
over S. To begin, let S be a set of points which may, for example, be 
the set of points inside and on a circle or an outer boundary curve such as 
that shown in Figure 13.31. The set S may be the set of points P which 
are neither outside the outer boundary nor inside the inner boundary of 
Figure 13.32. What we really require is that the set S have positive 
area and that the points of S lie inside or on some rectangle R so that the 
set S is bounded. In some applications the set S is regarded as a lamina 
(thin plate) or as a plane section of a three-dimensional solid in which 
we are interested. While we normally use coordinates (rectangular or 
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polar, for example) to determine a point P of S, we do not at present 
allow any one brand of coordinates to dominate our work. We suppose 
that we have a bounded function f defined over S and use the symbol 
f(P) to denote the value of f at P. For example, if S is a lamina, f(P) 
could be the density (mass per unit area) at P or the product of the density 
at P and the specific heat at P and the temperature at P. IfSisa lamina 
and we want to calculate its moment of inertia about a line L, f(P) could 
be the product of the density at P and the square of the distance from 
P toL. Itis often helpful to think of | f(P)| as being the height at P of a 
solid which stands upon the base S. 
The first step in our approach to a Riemann sum is to make a partition 
Q (the letter P has been preempted) of the set S into n subsets Sj, S», 
, Sa. As Figures 13.31 and 13.32 indicate, the result of partitioning 


Figure 13.31 Figure 13.32 


a set S in E» into smaller subsets is not as simple as the result of parti- 
tioning an interval in Æ, into subintervals. The only things we require 
of the sets Sı, So, © © © , Sa is that they be nonoverlapping, that their 
union be S, and that each of them have positive area. It turns out that 
the notational transition from Riemann sums to Riemann integrals will 


be facilitated by denoting the areas of the sets Sı, S2, © © © , Sa by the 
symbols AS;, AS, - © © , AS,. The meanings of our symbols should be 
emphasized. For each k = 1,2, -- > , n, the symbol AS; does not stand 
for a part of the set S; it stands for the area of a part of the set S. For 
each k = 1,2, --- , n, let P be a point in the set Sx. The number RS 
(Riemann sum) defined by 

(13.33) RS = 2 f(Px) AS: 


is then a Riemann sum formed for the function f and for the partition Q 
of S. 

In order to tell what we mean by the norm |Q| of the partition Q, it is 
necessary to introduce a simple geometrical concept. The diameter of a 
set is the least upper bound of distances between pairs of points of a set. 
The norm |Q| of the partition Q of S into subsets S1, So, - © - , Sa is the 
greatest of the diameters of the sets S1, S2, - + © , Sn. Weare now ready 
to define the Riemann integral of f over S, the definition being analogous 
to that involving (4.23). If there is a number J such that to each e > 0 
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there corresponds a 6 > 0 such that 


(13.34) | I — 2 fa AS; | <e 


whenever the sum is a Riemann sum formed for the function f and for a 
partition Q of S for which |Q| < 6, then f is said to be Riemann integrable 
over S and J is said to be the Riemann integral of f over S. This integral 
is usually denoted by the symbol 


(13.35) |, KP) as, 


which displays the function f and the symbol S that represents the set 
which was partitioned to obtain the approximating Riemann sums. The 
integral is called a double integral because the set S is two-dimensional, 
that is, a set in Æz having positive area. The two integral signs serve to 
remind us that S is two-dimensional, but sometimes one of them is 
omitted from the symbol. As was the case for simple (that is, one- 
dimensional) integrals, it is a convenience (and sometimes also a source 
of misunderstanding, confusion, and controversy) to drag in the nota- 
tion of limits and write 


(13.36) JJa 1P) a8 = tim X FP) aS, 
or 
(13.37) Jf; KP) dS = lim Y fP) as. 


When we are interested in problems in which a function f defined over 
a bounded set S is involved and rectangular coordinates are to be used, 
we can produce substantial simplifications of our work by letting R be a 
rectangular set which contains the set S and by extending the domain of 
f by putting f(x,y) = 0 when (x,y) is a point of R which is not in S. The 
following theorem then enables us to evaluate double integrals by evalu- 
ating iterated integrals. 

Theorem 13.38 If S is a subset of the rectangular region R consisting 
of points (x,y) for which a S x S b and c S y Sd, if f(x,y) = O when 
(x,y) is a point in R but not in S, and tf the four integrals 


(13.381) Z= ff, fey) 4S, I = [fa flay) dR, 
(13.382) I= f; ds f fley) dy, L= fi dy fy floss) dx 
all exist, then 

(13.383) heh=h=I, 


that is, the four integrals are all equal. 
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Proof of this theorem lies far beyond the scope of this course. Persons 
who continue study of mathematical analysis until theories of Lebesgue 
measure and Lebesgue integrals (including a theorem known as the Fubinj 
theorem) have been learned will find that validity of the theorem will be 
an easy consequence of fundamental relations between Riemann and 
Lebesgue integrals. For the present, we can be content with a hazy 
understanding of the fundamental fact that the double integrals J, and 
I will exist if f is bounded and the set D of discontinuities of f has area 
(two-dimensional Lebesgue measure) 0 and, moreover, the iterated 
integrals I and J4 will also exist if it is also true that each horizontal line 
and each vertical line intersects D in a set having length (one-dimensional 
Lebesgue measure) 0. So far as elementary applications to elementary 
problems are concerned, we can be sure that if the set S and the function f 
are bounded, then the double integrals in (13.381) and the iterated 
integrals in (13.382) must exist and must have the same value. 

Symbols used for iterated integrals were discussed in Section 13.1. 
In addition to the symbols used in this section for double integrals, 
those appearing in the formula 


(13.384) ff, flay) aS = ff, fey) dx dy = fy fey) de dy 


are sometimes used when rectangular coordinates are involved. 


Problems 13.39 
I Let S be the set in Ez bounded by the graphs of the equations 
y = x?, y=x-+2. 
Supposing that f is continuous over S and that 
J= ff, fev) 4s, 
sketch a figure which displays the set S and an appropriate rectangular region R 


and then write complete and intelligible descriptions of the steps involved in 
using Theorem 13.38 to obtain the formulas 


J= f? ae [ZT fey) ay 


J = ik dy [% f(x,y) dx + f af? f(x,y) dx. 


2 Supposing that O < a < b and that f is continuous, determine a set S 
in the xy plane such that 


f dy pe f(x,y) dx = J I, Fay) dS. 


and 
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Then show that 


a x b a 
Sferen aS = fy ae fi flew) dy + f} ae fË soy) ay 
a+b 
+ fo da [E AEn) ay. 
b z—6b 
3 Supposing that S is a bounded set having positive area, interpret and prove 


the statement 
[S| = | gids. 


Hint: Look at the definition of double integrals. 


4 Supposing that f is continuous over S, use Theorem 13.38 and the method 
of Problem 1 to obtain iterated integrals equal to 


|, 79) as 


when S is the set in Ez bounded by the graphs of the equations 


(a) y =x, y = me+b (mb > 0) 
(b) y =x, y = 2x, 4 = 1 


(c) y =O, y =x, y = 3x —2 
(d) y = x? y =x 
le) y =e y =0,x=0,x=l 


(f) y =x, y = sin x, x = 1/2 


5 Observe that sin (x + y) is continuous and nonnegative over the square 
S in the xy plane having opposite vertices at the points (0,0) and (r/2, x/2). 
Evaluate 
I sin (x + y) dx dy. 
Ans.: 2. 
6 Evaluate 


Lies 


when Q is the square having opposite vertices at the points (0,0) and (1,1). 
Ans.: 3 log 3 — 4 log 2. 
7 Supposing that 0 < a<band0 < p < gq, evaluate 


[leone 


when S is the region bounded bY the lines having the equations x = a, x = b, 
y = px, and y = qx. Ans: (er — 6?) (b? — a’). 

8 Supposing that S is the square having opposite vertices at the points (0,0) 
and (1,1), show that 


a) I. - ds dy = f: ax f dy = | ELTA ae 
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provided the integrals exist. With the aid of series and termwise integration 
(which can be justified), continue the work to obtain 


2) Jhs» 2 Bos 


Remark: Persons who study more advanced mathematical analysis may encounter 


the following elegant theorem. If S is a set in Ey, if Sı is a subset of S having 
measure Q, if 


(3) f(P) = (P) + ua(P) + uP) + °° 


when P is in S but not in Si, and if u,(P) 2 0 for each k when P is in S, then the 
formula 


(4) Í f(P) dS = [en dS + Í ua(P) dS + Í us(P)dS +--> 


is valid provided the integrals exist as Riemann integrals or Riemann-Cauchy 
integrals or Lebesgue integrals. 

9 Obtain the result (2) of the preceding problem by termwise integration 
(which can be justified) of the series in 


Pop =l Hay Hayt H ayt t o. 

10 This long problem is for persons who wish to become good mathematicians 
or physicists or engineers and who realize that the best ones start learning about 
important things while they are young and then continue to learn more. Make 
a large copy of Figure 13.291 showing the set S which is the set of points (x,y) 
for which a Sx < b and fi (x) S y S fe(x) and is also the set of points (x,y) 
for which c S y S dand g,(y) S x < ge(y). Let C be the curve consisting of the 


boundary of S traversed once in the positive direction. Let F andG be continuous 
over S and let 


qo I= 1 FEDAS, J= ff Gey) a8. 


Show that 


g2(y) b J2(z) 


We enter a gate to an important scientific garden when we suppose that P and Q 


are functions which are defined and have continuous first-order partial derivatives 
over S and put 


3) Fay) = 2 = 0.659), Gleg) = E = Pyle). 
Show that in this case 
(a T= [ay [EP Quy) dx = [* al Gey) [Ur 


= f° O20), 9) dy — f° O10), ») ay. 
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Be sure to realize that this is correct because, for each fixed y, Q(x,y) is a function 
of x whose derivative with respect to x is the integrand Q.(x,y). We must now 
look at our figure. As y increases from ¢ to d, the point with coordinates (g2(y), y) 
traverses, in the positive direction, the part Cı of C which lies to the right of S. 


It is a consequence of the definition of curve integrals given in Problem 15 of 
Section 7.2 that 


6) J. e0) 9) ay = f, OD) ds, 


the parameter now being y instead of t. The last term in (4) is more troublesome. 
As y increases from c to d, the point with coordinates (e1(y), y) traverses, in the 
negative direction, the part C2 of C which lies to the left of S. This reversal of 
direction introduces a change in sign so that 


6 - JE OEG) ») dy = f OG) ay. 


Show that combining these results gives the formula 


n 2s- fo% 


Tell why 
b 
(8) J=f d 


J2(zx) 


b y=f(x) 
hay Pils) dy = f° dx | Ps) | 


y==f1(z) 


= fi Ple fue) dx — [? P(e, fle) dx 


3P 
— I J oS = fa? dy. 


Note that combining (7) and (8) gives the formula 


(10) f J, ($£ — 3) dS = Í. (P dx + Q dy). 


This formula appeared in the works of George Green (1793-1841), a pioneer in 
applied mathematics who originated the term potential function, and the formula 
is called a (or, sometimes, the), Green formula. 

The Green formula and its extensions have very important applications, some 
of which involve vectors. To be broad-minded about the matter, let 


(11) V(x,y,2) = P(x,y,2)i + O(x,9,2)§ + R@y,2)k, 


where V is a vector function having scalar components P, Q, R that are con- 
tinuous and have continuous partial derivatives over a part of Es in which our 
sets and curves are supposed to lie. The curl of the vector function V is a vector 
function which is written V X V, which is read “‘the curl of V” or “‘del cross V” 
and is defined by the formula 


and 


(12) VxVe= 


Plo 
O PS | D e 
7 P| o 
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in which the right member is a determinant. The expanded form of (12) is 


a3)  vxv=(5- 2) i +($- i+ (32 -5) k. 


For the special case to which (10) applies, S is a patch of surface in the xy plane 
and k is the unit normal to S which lies in the direction of the thumb on a right 
hand when the fingers point in the direction in which C is oriented. If we denote 
this unit normal by n, then in the special case (13) reduces to 


(14) (Vx V)-n = oe — 5. 


Therefore, the left member of (10) is, in the special case, the left member of the 
formula 


(15) J I, (Y x V)-n dS = AZ 


Our next step is to show that the right of (10) is, in the special case, the right 
member of (15). This is quite easy. Let functions x(2), y(t), z(¢) be such that 
the point P(t) having coordinates x(t), y(¢), 2(@) traverses C once in the positive 
direction as ¢ increases from ?f; to tz and let 


(16) rę) = «()i + y®j + 2@)k. 


For the case in which these functions have piecewise continuous derivatives, dif- 
ferentiation gives 


(17) r(t) = x(t) + y'i + 2’ Ok. 
Then 
(18) Ver’) = Px’) + Qy (6) + Rz (t), 


where V stands for V(x(), y(t), z2(t)), P stands for P(x(t), y(t), z(ż)), and so on. 
Hence 


(19) [ ” V-r'(t) dt = Í “ [Px'(t) + Oy'(t) + Re (0) di 


and these integrals are, by definitions of curve integrals, respectively equal to 
those in the formula 


(20) I V-dr = h (P dx + Q dy + Raz). 


In the special cases where C lies in the xy plane, we have z(t) = O for each ż, so 
the right member of (10) is equal to the right members of (19) and (20) and hence 


is equal to the left member of (20), which is the right member of (15). 
The formula (15), that is, 


(21) f J (V x V)}-n dS = AZ 


which reduces to the Green formula (10) when S and C lie in the xy plane, is 
known as the Stokes formula. So far, (21) has been proved only when S is a 
set in the xy plane which satisfies the heavy restriction given at the beginning 
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of this problem. One example is sufficient to expose an interesting idea that 
enables us to establish (21) for less simple sets S in the xy plane. To establish 


(21) for the set S of Figure 13.391, we split S into two subsets S, and Sz as in 


Figure 13.391 Figure 13.392 


Figure 13.392. Writing the formulas obtained by applying (21) to the simpler 
sets Sı and Sz and adding the results shows that (21) is valid for S because the 
curve integrals over the common boundary of S; and S> come with opposite 
signs and cancel out of the sum. After having proved that (21) is valid for sets 
of particular types that lie in the xy plane, the next step is to recognize that, when 
C is a suitable plane curve, the right side of (21) has an intrinsic meaning which 
is independent of coordinate systems. As can be suspected, this fact can be 
used to show that the left member of (21) and the curl itself also have intrinsic 
meanings. To be appropriately narrow-minded about this matter, let S be a 
plane triangular set or plane circular disk in E; which is bounded by an oriented 
triangle or oriented circle C. Then (21) is valid because the simpler Green 
formula shows that it is valid when the coordinate system is chosen such that 
S lies in the xy plane. The method that was applied to the plane sets of Figures 
13.391 and 13.392 can now be employed to prove that (21) holds when S is a 
triangulated oriented surface in Es consisting of a finite set of plane triangular 
faces bounded by oriented triangles provided the topological structure and 
orientations are such that if a side of a triangle 


is a part of the boundaries of more than one 
triangular face, then the side is a part of IW, 
boundaries of exactly two such faces and, as 
in Figure 13.393, the side has opposite orienta- 
tions in the two triangles that contain it. 
While a full treatment of the matter lies 


beyond the scope of this book, teachers of WAN 
courses in electricity and magnetism and aero- 


dynamics (among others) require knowledge of Figure 13.393 
consequences of the idea that the Stokes for- 
mula (21) is valid when S is a patch of surface in E; that can be satisfactorily 
approximated by a triangulated oriented surface of the type described above. 
One of the important applications of the Green and Stokes formulas involves 
conservative force fields. Let the vector V(x,y,z) in (11) be the force on a par- 
ticle when the particle is at the point (x,y,z). The force field determined by the 
vector function V is said to be conservative if the work done in moving the particle 
around a closed curve C is 0 when C belongs to a class of curves which is not 
always carefully defined but which certainly includes circles. If V is conserva- 
tive over a region in E3, then the right member of (21) must therefore be 0 when 
C is a circle in the region. Then the left member of (21) must be 0 when S is a 
plane circular disk in the region and therefore (as can be proved) the hypothesis 
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that V X V is continuous over the region implies that V X V =O over the 
region. On the other hand, if V X V = 0, then the left member of (21) is clearly 
0, so the right member must be 0 and V must be conservative This proves the 
very useful nontrivial fact that if V and its scalar components P, Q, R are con- 
tinuous and have continuous partial derivatives over a region in £3, then V is 
conservative over the region if and only if the formulas 


ðQ ðP ðR oP ôR 9Q 
dx Oy Ox Oz Oy Oz 
hold over the region. 

11 There is much to be learned about the process of Problem 10 by which the 
Stokes formula (21) is proved first for simpler surfaces composed of oriented 
plane triangles suitably joined at their edges and then for curved surfaces that 
can be satisfactorily approximated by the simpler surfaces Relatively few 
people undertake to master all of the details, but everybody can see that some 
quite delicate topological considerations are involved. Classical examples 
involve ordinary bands and Mobius bands. When the ends of a strip of paper 
a foot long and an inch wide are joined in the simplest way, the result is an 
ordinary curved band that has two edges (a top and a bottom) and two sides 
(an inside and an outside). It is easy to put a dozen diagonal creases in the 
paper to obtain a band composed of a dozen plane triangular patches. Let S, 
be the surface composed of the points on the outside of the latter band. It is 
easy to orient the triangles as in the discussion of Figure 13.393 and to obtain 
the Stokes formula for Sı. To make a Mobius band, we start with another strip 
of paper a foot long and an inch wide, but this time we put a twist (a half-turn) 
in one of the ends before the two ends are joined. ‘This strip can be creased to 
obtain a band composed of plane triangles joined at their edges. It turns out 
that the Mobius band has just one edge and just one side, there being no “‘side”’ 
that is “an outside” that is different from “the inside.” Inner secrets are 
revealed to those who try to color only “the outside” of this band. Persons 
interested in this matter may construct Mobius bands and study their properties. 
It is quite easy to obtain the correct idea that topological considerations form an 
essential part of rigorous (free from blunders) statements and proofs of theorems 

, setting forth conditions under which the 
Figure 13.41 Stokes formula is valid. 

13.4 Rectangular coordinate ap- 
plications of double and iterated 
integrals This section illustrates 
ideas that are often used when prob- 
lems are being solved with the aid 
of double and iterated integrals in- 
volving rectangular coordinates. The 
principal illustration involves a lamina 
(or flat plate) which, as in Figure 
13.41, lies in the xy plane and is 
bounded by the graphs of the equa- 
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tions y = 0, y = x, and x=]. We suppose that, at each point (x,y) 
of the lamina, the lamina has areal density (or mass per unit area) 5(x,y). 
This means that to each e > O there corresponds a 6 > 0 such that 


(13.42) 


whenever AS is the area of a part of the lamina containing the point 
(x,y) and having diameter less than ô and Am is the mass of the part. 
In the simplest applications, there is a constant k, which may be 1, such 
that 5(x,y) = k whenever (x,y) is a point of the set S occupied by the 
lamina; in this case the lamina is said to be homogeneous. While the 
ideas can be applied in some other cases, we suppose that 6 is continuous. 
Supposing that xo is a given number and that $ is a given nonnegative 
integer that is 0 or 1 or 2 in most applications, we undertake to learn the 
techniques involved in setting up three different expressions for MPa» 
the pth moment of the lamina about the line x = xp. 

To set up a double integral for M2, we chop the rectangle of Figure 
13.41 into subrectangles by lines parallel to the coordinate axes. A 
particular subrectangle, such as the one shown in Figure 13.41, has area 
Ax Ay. Supposing that the subrectangle lies entirely within the lamina, 
we select a point (x,y) in the subrectangle and use the number (x,y) Ax Ay 
as an approximation to the mass of the part of the lamina within the 
subrectangle. If this total mass were concentrated at the point (x,y), its 
pth moment about the line x = x would be 


(13.43) (x — x0)?5(x,y) Ax Ay. 


We therefore use this number as an approximation to the pth moment 
about the line x = xo of the part of the lamina in the one subrectangle. 
The sum 


(13.431) E(x — x0)?5(x,y) Ax Ay, 


which contains a term for each subrectangle in the lamina, should then 
be a good approximation to the total pth moment of the entire lamina 
whenever the diameters of the subrectangles are all small. This leads us 
to the formula 


(13.432) M? = lim D(x — xo)?6(x,y) Ax Ay, 
(p) 


the right side of which is taken to be the definition of the number Min 
which we are seeking. In accordance with the theory of double integrals 
involving (13.34) and Theorem 13.38, the right side of (13.432) is a 
double integral which we can denote by one or the other of the symbols 
in the formula 


(13.433) M? = J Í (x — x0)?8(x,y) dS = f Í (x — 20)?5(x,y) dx dy. 
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With the aid of Theorem 13.38 we can quickly express the above 
double integral as an iterated integral in two different ways. It is, 
however, worthwhile to learn to use a procedure which leads directly 
to iterated integrals without making use of double integrals. As in the 
preceding paragraph, we observe that Ax Ay is the area of a subrectangle 
and use the number 6(x,y) Ax Ay as an approximation to the mass of the 
part of the lamina within the subrectangle. Again we note that if this 
total mass were concentrated at the point (x,y), its pth moment about the 
line x = x would be 


(13.44) (x — x0)?5(x,y) Ax Ay. 

We then form the sum 

(13.441) Ax d (x — 20)?8(x,y) Ay 
z fixe 


where the part “x fixed” of the symbol serves to inform us that the sum 
contains only terms arising from those subrectangles which comprise a 
vertical strip such as that shown in Figure 13.445. When the numbers 
Ax and Ay are all small, the coefficient of Ax in (13.441) is a Riemann sum 
which is a good approximation to the coefficient of Ax in the expression 


(13.442) Ax ii (x — 2¢0)?6(x,y) dy. 


Using this as an approximation to the pth moment about the line x = xp 
of the part of the lamina in one strip, we are led to expect that the sum in 


(13.443) M®?, = lim Y Ax [7 (x — x0)?8(x,y) dy 
Zo 0 


will be good approximation to the required total moment when the num- 


bers Ax are all small and hence that (13.443) should be a valid formula. 
This gives 


1 x? 
(13.444) M®,, = I, dx I, (x — 20)?8(x,y) dy 


because the sum in (13.443) is a Riemann sum which approximates the 


integral in (13.444). 


Figure 13.445 Figure 13.446 
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A simple modification of the preceding paragraph gives an iterated 


integral in which the first integration is with respect to x. Instead of 
(13.441), we form the sum 


(13.45) Ay J, (œ — x0)8(x,y) Ax, 


y fixed 


where the part “y fixed” of the symbol informs us that the sum contains 
only terms arising from subrectangles which comprise a horizontal strip 
such as that shown in Figure 13.446. When the numbers Av and Av 
are all small, (13.45) and 


(13.451) Ay | ia © — 20)?8(x,9) dx 


are good approximations to the pth moment about the line x = xo of the 
part of the lamina in one strip, and we are led to the formula 


(13.452) M2. = lim > Ay i (x — x0)6(x,y) dx 


and hence to the formula 
1 1 
(13.453) M2., = h dy Í y (Œ — 0) Pòle,y) dx 


for the pth moment about the line x = x of the whole lamina. 

Several quite simple and obvious remarks can now be made. In order 
to obtain derivations of formulas for M? „ the pth moment of the lamina 
about the line y = yo, it suffices to replace the factor (x — xo)? by the 
factor (y — yo)? in the above derivations. In case p = 0, the factors 
(x — xo)? and (y — yo)? are both equal to 1 and the numbers Me, 
and Mn are both equal to the mass M of the lamina. Thus 


(13.454) M = f h 5(x,y) dx dy, 


and we can replace this double integral by iterated integrals. 

In case p = 1, the moments are first moments or moments of first order. 
In case xo and yo are chosen such that M{2,, =0 and M2, = 0, the 
point (xo,yo) is called the centroid of the lamina. It is customary to let 
z and # (« bar and y bar) denote the coordinates of the centroid. The 
equations which determine # and 7 then become M, = 0, MV, =0 


or, as we see from (13.433) and the similar formula for MY, 


(13.46) Ih (x — £)8(x,y) dx dy = 0, I (y — 5) êlx,y) dy = 0. 


Since # and f are constants that can be moved across integral signs, we 
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can put these equations in the form 


(13.461) % Is 5(x,y) dx dy = I xôlx,y) dx dy, 
5 ffy icy) dx dy = ffy vd(x,y) dx dy 


ao 162) z= I. x5(x,y) dx dy n I y(x, y) dx dy 
| a If, 5(x,y) dx dy É Ih 5(x,y) dx dy 


where the denominators are equal to the mass M of the lamina. It is 
sometimes helpful to know that if, as in Figure 13.463, the line x = x, 
is a line of symmetry of a homogeneous lamina, then Ma = 0 and 
hence = x9. In order to find the first moment M2, of the lamina 


about the y axis, it suffices to calculate the mass M of the lamina and 


X=Xp x 


Figure 13.463 Figure 13.464 


use the formula M% = M®,. If, as in Figure 13.464, the lines x = xo 
and y = yo are both lines of symmetry of a homogeneous lamina, then 
% = x) and ¥ = yo, so the centroid of the lamina is the point (%o,yo). 

In case p = 2, the number M? becomes 


(13.47) M®.,, = ffo (« — %0)*8(x,y) dx dy, 


the second moment or moment of inertia of the lamina about the line x = xo. 
When these things are being calculated and used in mechanics and else- 
where, information concerning moments of inertia about parallel lines 
(or axes) is very helpful. To obtain information of this nature, we let 
% be the x coordinate of the centroid and use the simple identity 
(x — xo)? = [(~ — #) + (% — x]? 
= (x — £)? + (ž — x)? + 2K — x0) (x — 7) 
to put (13.47) in the form 


(13.471) M®,, = J [, @ — Daly) dx dy + (E — x)? f h 5(x,y) dx dy 
+ 2@ — x0) | Í (x — #)8(x,y) dx dy. 


The first term in the right member is M@., the moment of inertia of the 
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lamina about the line through the centroid parallel to the line x = xo. 
The second term is (¥ — xo)?M, where M is the mass of the lamina. 
The third term is 0 because the integral is the first moment of a lamina 


about a line through its centroid. Thus (13.471) reduces to the impor- 
tant formula 


(13.472) Mr, = M2, + (% — x)?M. 


This gives the following parallel axıs theorem. 

Theorem 13.48 The moment of inertia of a lamina about a line is equal 
to the sum of two terms, one being the moment of inertia of the lamina about 
the parallel line through the centroid and the other being the product of the 
mass M of the lamina and the square of the distance between the two lines 

Up to the present time, we have considered 
only moments of plane laminas about lines in 
theplanesofthelaminas. Thesecond moment 
or moment of inertia of a lamina about a line C28) 
L perpendicular to the plane of the lamina is 
called the polar moment of inertia of the lamina 7 
about the line L. As before, let the lamina . 
cover a set S in the xy plane and let L be the Figure 13.481 
line in Æ; having the equations x = xo y = yo. Letting AS or Ax Ay be 
the area of a part of the set S which contains the point (x,y) and letting 
5(x,y) denote the density of the lamina at the point (x,y), we use the number 


(13.482) [(~ — xa)? + (y — yo)?]8(x,y) Ax Ay 


as an approximation to the polar moment of inertia about L of the part 
of the lamina. The polar moment of inertia about L of the whole lamina 
may be denoted by the symbol M2, y- It is defined by the formula 


(13.483) Moz, yyy = lim Z[(e — xo)? + (y — yo) *18(x,y) Ax Ay 


or 


(13.484) MPa yom = [fo [Ge 2)? + O — yo)16(%9) dx dy. 


œ” 
(Xo, Yo) 


Comparing this with the formula (13.47) for MẸ» and the corresponding 
formula for M‘? gives the formula 


(13.485) M? nun = Map + MED 


which says that the polar moment of inertia of the lamina about the 
line x = xo, y = yo perpendicular to the lamina is equal to the sum of the 
moments of inertia of the lamina about the two lines x = xo and y = yo 
in the plane of the lamina. Finally, we note that the parallel azis 
theorem, Theorem 13.48, holds for polar moments of inertia as well as 
for moments of inertia about lines in the plane of a lamina. 


682 Iterated and multiple integrals 
Problems 13.49 


1 A rectangular lamina has opposite vertices at the origin and the point 
(a,b), and has areal density (mass per unit area) 6(x,y) at the point (x,y). Set 
up an iterated integral for the pth moment of the lamina about the x axis. Then 
evaluate the integral for the cases 


(a) 5(,y) = 1 (b) 6@,y) = kx (c) d(%,y) = ky 


Ans.: The required integral is 


b b a 
N dx | yPb(x,y) dy or f dy f yP (x,y) dx. 


abrt! kabet! kabrt? 
p+l 2@+1) p+2 

2 Supposing that 0 < p < q, set up and evaluate an iterated integral for 
the area Æ of the region in the first quadrant bounded by the graphs of the 
equations y = xP andy = x. Ans.: 


The required moments are respectively —— 


A= om d= FIGF 


3 When0 < p < gq, the region in the first quadrant bounded by the graphs 


of y = x? and y = xt has area (q — t)/ (b + 1)(4 + 1). Find the coordinates 
of the centroid of this region. <Ans.: 


@+N)@+1 ~_ @+1G+)1) 
~(e+2)q¢+2) lT 2FED +I) 


1 


4 Find the centroid of the region bounded by the x and y axes and the 
graph of y= e7. Ans.: # = 1,9 = ¢. 

5 Find the centroid of the long golf tee obtained by rotating the region of 
Problem 4 about the x axis. Ans.:% = 4, 9 = 0,2 = 0. 

6 Find the centroid of the region in the strip 0 S x S m bounded by the x 
axis and the graph of y = sin x. Ans.:% = 27/2, 9 = 1/8. 

7 Find the centroid of the region which lies in the interval 0 S x S 2a 


and is bounded by the graphs of the equations y = 0 and y = b sin i Ans.: 


8 A vertical face of a dam is bounded by the segment 0 S x S 2a of the x 
axis and the graph of the equation y = —b sin i The water level is at the top 
of the dam, and the weight per cubic unit of the water is w. Find the magnitude 
of the force on the dam. Ans.: $wab?. 


9 Find the centroid of the region in the first quadrant bounded by the coordi- 


nate axes and the hypocycloid having the equation x% + y% = a%. Ans.: 
_ _ 256a 


x =Y Blox 
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10 Solve Problem 9 again, using the parametric equations 
x = a cos? i, y = asin? ? 


to obtain integrals involving ż. 
11 A lamina having density xy at the point P(x,y) lies in the first quadrant 
and is bounded by the coordinate axes and the ellipse having the equation 
x? y? 
2 + B= 1. 


Find its mass M and the coordinates of its centroid. Ans.: M = 4a°b?, % = 
5a, y = Tso. 

12 For the region R bounded by the positive x axis and the graphs of y = 
xe and x = a, find each at the following: 


(a) The area of R Ans.: 1 — (a+ 1)e7 
(b) The volume of the solid obtained by rotating R about the x axis 
Ans.: =n{1 — (1 + 2a + 2a?)e7*4] 
(c) The volume of the solid obtained by rotating R about the y axis 
Ans.: In[2 — (2 + 2a + a*)e74| 
(d) The first moment of R about the x axis Ans.: $[1 — (1 + 2a + 2a?)e7™4] 
(e) The first moment of R about the y axis Ans.: 2 — (2 + 2a + ae 
(f) The moment of inertia of R about the x axis 
Ans.: s7[2 — (2 + 6a + 9a? + 9ad)e-34] 
(g) The moment of inertia of R about the y axis 
Ans.: 6 — (6 + 6a + 3a? + a®)e 
(h) The polar moment of inertia of R about the line through the origin perpen- 
dicular to the plane of R Ans.: Sum of answers to (f) and (g) 


13 A triangular lamina has vertices at points Pi(%1,91), P2(x2,2), Ps(%3,9's) 
and has areal density (mass per unit area) 6(x,y) at the point (x,y). Assuming 
that the points are placed as in Figure 13.491 so that xı < x3 < xzand ye < yı < 


Figure 13.491 


y3, and letting m, denote the slope of the side opposite Px so that 


y3 — y2 ys — Jı y2 — Yı 


m =m = SB, my = 7 
x3 — Ke x3 — Xı x2 x} 


set up an iterated integral for the pth moment M? of the lamina about the 
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line x = xo Ans.: 
m _ [P poe _ 
MPa = fo dx Pee (œ — x0) B(ay) dy 
xa y2tmi(z —2x2) 
+ I. dx | (x — x0)? 5 (x,y) dy. 
14 Develop computational skill by simplifying the answer to Problem 13 


for the case in which 6(x,y) = 1 and p = 0 so the answer is numerically equal to 
the area of the lamina. Ans.: 


x1 Yi 1 
El(x1ye — xy) — (x1y3 — xay1) + (x2y3 — x3y2)] or lx: ye 1b 
x3 V3 1 


15 Figure 13.492 shows two parallel rods of lengths a and b. The rod on 
the left has linear density (mass per unit length) 6:(¢) at distance ¢ from its lower 


Figure 13.492 


end, and the rod on the right has linear density 62(u) at distance u from its lower 
end. We undertake to learn about the total or resultant gravitational force F on 
the left-hand rod that is produced by the right-hand rod. Assume that particles 
of mass mı and mz at points Pı and Pz attract each other with a force of magnitude 
Gmym2/ [PiP]? where G is a gravitational constant, and the actual force pulling 
the particle at Pı toward P2 is obtained by multiplying this magnitude by P,P,/ 
\P:P.|, the unit vector which has its tail at P, and points toward P}. Derive 
the formula 

ô (t) At z(u) Au , 

OD T lD + (u — Oi 

for the force which an element (or subset) of the rod on the right exerts upon an 
element of the rod on the left. Then derive the formula 


> d2(u)[Di + (u — 4)j] 
CaO A | Fune A 


for the force which the whole rod on the right exerts upon the element of the rod 
on the left. Then derive the formula 


_ a b b2(u)[Di + (u — ż)j] 
F =c f t) as f u ape a 


16 ‘Two identical slender rods having constant linear density 5 occupy the 
intervals —a Sx S —eand e <x S a of an x axis. It is supposed that0 < e€ 
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< a, and we can be interested in situations in which eis small. Set up an integral 
for the gravitational force which the rod on the right exerts upon the rod on the 
left. 

17 Figure 13.493 shows rods of lengths a and b that have constant linear 
density 5, that lie on the x and y axes, and that are hinged at their ends at the 


y 


(0, b) 
(0, yR 


(x,0) (2,0) x 
Figure 13.493 


origin. A little segment of the horizontal rod in a neighborhood of (x,0) pro- 
duces a little gravitational force u on a little segment of the vertical rod in a 
neighborhood of (0,y). This little force u has a little scalar horizontal compo- 
nent uz which produces a little torque (or first moment) yuz which tends to rotate 
the vertical rod toward the horizontal one. There are hordes of little torques. 
Set up an integral for the total torque. 

18 Most people having serious interest in mathematics want to see and 


perhaps study the nontrivial steps by which the important Euler gamma integral 
formula 


(1) alm [ert at (2 > —1) 


is derived from the definition of z! given in Problem 11 of Problems 3.39. We 
start with the fact that, when z is not a negative integer, 2! is defined by the 
formulas 


t= li o in 
(2) z im Fala), Fale) Z@tij2+2)°°'@t n) 
Expressing [(z + 1)@ + 2) +--+ (z+ n) as a sum of partial fractions leads 


to the formula 
nol n—1 1 
— wit — 1\k ee © 
Fata) =a ( DÒ; reer 


To put this in a form that can be simplified by use of the binomial formula, we 
use the fact that 


l — T z+k = f Za,k 
(3) zti T” du = j wl du 
when z > —1 and k =0, 1, 2,.... Assuming henceforth that z > -l, 
we find that 
1 "o n 1 n—k kd 
(4) Fa =n foe DY ) eka) du 


and hence that 


(5) Fla) = nH h u(i — u)" du. 
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Changing the variable of integration by setting u = t/(m — 1) then gives 


1\ztl pn-l i n—l 
(6) F(z) = (1 — =) Í, e(1-— :) dt 
when n > 1. Therefore 


(7) z! = lim | Ga(t) di 
n— © 0 
where 
t n—l 

(8) Ga) = #(1- 44) O<:Sn-1) 
and G,(t) = O whent 2 n. It can be shown that 
(9) lim G,(t) = te~, IGn(t)| S [e]. 

N> 0 


While full exploration of the matter lies beyond the scope of elementary calculus, 
(1) is a consequence of (7), (9), and the Lebesgue criterion of dominated con- 
vergence for taking limits under integral signs. When m and s are numbers for 
which s > m, we can putt = (s — m)x in (1) to obtain the formula 


z! “0 
(10) z——— n = Í, e™Tyte™T dx. 


(s — m)?1 


Particularly when it is recognızed that (10) is valid even when m is complex, this 


single formula (10) is the equivalent of a huge table of Laplace transforms and is 
therefore very important. 


19 We examine the formulas by which ideas of this chapter are used to start 
with the Euler gamma integral formula 


(1) z! = if tze dt 
of the preceding problem and derive the beta integral formula 


p adi = Pt _ 
(2) hea- dtg 


It is supposed that z, p, and q are complex numbers having real parts exceeding 
—1. Use of (1) gives 


(3) p4! = in xPe-* dx I ye dy 


where the right side is the product of two integrals. Writing this as an iterated 
integral and putting y = u — x give 


(4) plg! = h dx J Pyae dy = h dx f° xP(u — x)%e~4 du 
0 0 0 z 


The Fubini theorem justifies change of order of integration to obtain the first 
equality in 


6) plat = fo ew du [feu — ade = JN urtee dy f (1 — t) dt 
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and putting x = ut gives the second equality Therefore 
1 
(6) p! = (p +4 + Di (1 — i)e dt 
and (1) follows. 


13.5 Integrals in polar coordinates Insomecasesa plane set S anda 
function f defined over S are such that the double integral defined as in 


(13.37) by 
(13.51) [f, f) dS = lim Y AP) as 


can be advantageously expressed ın terms of polar coordinates p and $. 
For example, suppose that, as in Figure 13.52, S is the set of points 


Figure 13.52 Figure 13.53 Figure 13.531 


having polar coordinates p, ¢ for which g:(¢) S p S g:(¢ġ) anda S ¢ <S 
b, where gı and ge are continuous functions for which 0 < gi(¢) S g2(¢) 
when a S¢ < $. Our first step is to partition S into subsets Sı, So, 
-- +, Sn by radial lines having the polar equations ¢ = ġo ¢ = 41, 
“°°, Ġ = om, where a = ġo < ¢1< °° * < ọm = 8, and by circles 
or circular arcs having the equations p = po, p = p1, °° * 5 P = Pm’, as 
in Figure 13.52. A typical one of the subsets is a part of a sector having 
corners at the points whose polar coordinates are shown in Figure 13.53. 
We set Ap, = p41 — p, and Adz = r41 — $y and then simplify the things 
we write by discarding all subscripts to obtain the polar coordinates 
shown in Figure 13.531. The symbol AS then represents the area of the 
shaded set in Figure 13.531. This shaded set is not a rectangle, because 
the straight sides are not parallel, and the inner and outer sides are arcs of 
circles which are not parallel line segments. Itis, however, thoroughly rea- 
sonable to have the opinion that, when p > 0 and Ap and Ag are small, 
the shaded set is “nearly”? rectangular and that AS should be closely 
approximated by the product of Ap (the length of one of the straight 
sides) and p Ad (the length of the inner curved side). This suggests that 
we should be able to use p Ag Ap as an approximation to AS. Much 
more can be said about this approximation business, and there are dif- 
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ferent ways of proving that, when p > 0, 


(13.532) AS L] 


im 
Ap—0, 26-0 PA p Ad 


It is, in fact, easy to work out an exact formula for AS, because AS is 
the difference of the areas of two sectors having central angle Ad. The 
larger sector has radius p + Ap and the smaller sector has radius p, so 


AS = $(p + Ap)? Ad — $o? Ad = H(2p Ap + (Ap)*) Ad 


and hence 


(13.533) As = € 4 2) Ag Ap. 


This can be used to prove (13.532), and it can also give us another idea. 
If we let p* = p + Ap/2, then we obtain the exact formula 


(13.534) AS = p* Ad Ap. 
One who wishes to do so may insist that the formula 
(13.535) AS = p Ad Ap 


is an exact formula obtained by setting Ap, = p,+1 — Pn Adz = k41 — dy, 
p* = p, + 4(p,11 — p), and discarding all subscripts and stars. When we 
put f(P) in the form f(p,¢), where p and ¢ are polar coordinates, we are 
therefore able to put (13.51) in the form 


(13.54) Jfa Kordo dé dp = lim } f(p,6)p A dp. 


Assuming that f is bounded over S and is sufficiently continuous to make 
all of the integrals exist, we can use Theorem 13.38 to express this double 
integral as an iterated integral. For example, when S is the set featured 
in Figure 13.52, we observe that the point having polar coordinates 
(p,¢) lies in S when a S ¢ S £ and, for each such ¢, g1(¢) S p S g:(9), 
SO 


(13.55) J|: f(0.#)0 do dp = f do [ES f(0,8)0 do. 


91(¢) 


Except in cases where S is a circular sector (which may bea whole circular 
disk) or the difference of two circular sectors (which may be a whole 
circular ring), it is usually not convenient to use formulas of the form 


(13.551) ff, Fedo de dp = f; do [ES Kodo de 


in which the first integration is with respect to ¢. The following three 
examples serve to show how double and iterated integrals in polar coordi- 
nates can be set up. It is not recommended that the formulas be 
remembered; we reconstruct them whenever we want to use them. 
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Example 1 Let0 Sa<6 Sr, let f(¢) Z 0 when a S$ < B, let f 
be Riemann integrable over the interval æ < ọ <S 8, and let S be the 
plane set of points having polar coordinates (p,) for which a < ¢ < B 
and 0 Sp Sf(¢). In case f is continuous, S is the set bounded by the 
polar graphs of the equations ¢ = a, $ = B, and p = f(¢). The problem 
is to find the volume |B| of the solid body B that is “generated” by rotat- 
ing the set S about the initial line (or x axis) of Figure 13.56. The first 


Figure 13.56 


step 1s to partition S into subsets by radial lines through the origin and 
circular arcs having centers at the origin. Letting p and ¢ denote the 
polar coordinates of a point in a typical subset, we use the number p Ap Ad 
as an approximation to the area of the subset. When the subset is 
rotated about the x axis, it generates a solid which may be thought of as a 
ring or hoop or gasket having radius p sin ¢ and length 2rp sin @. The 
number 
(2rp sin $)(p Ap AQ), 
or 
2rp? sin @ Ap Ad, 


being the product of the length of the ring and the area of a cross section 
of the ring, is then used as an approximation to the volume of the ring. 
The sum in 


(13.561) |B| = lim Z2mp? sin ¢ Ap Ad 


is then used as an approximation to the volume |B| of the body B, and 
the limit is (without proof) taken to be the exact volume |B|. The 
right member of (13.561) is a double integral. Expressing this as an 
iterated integral gives 


(13.562) Bl = [f do [0° 2mo sin $ dp 
or 
(13.563) IB] = 2r [$ sino do fi” odo 
or 


(13.564) BI =% | f EET sin $ dé. 
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It is particularly easy to evaluate this integral for the special case in 
which there is a constant R for which f(¢) = R whene Sẹ <B. In 
this case, S is a circular sector and 


(13.565) IBI = = [cos æ — cos £]. 
In case a = 0 and 8 = a, the solid body B is a complete spherical ball of 
radius R and the right side reduces to the correct volume $7R?. 
Example 2 Leta < 6 £ a + 2r, let f(¢) 2 Owhena S ¢ < £, let f 
be Riemann integrable over the interval a S ¢ < $, let S be the plane 
set of points having polar coordinates (p,¢) for which a S ¢ S £ and 
0 S p S f(¢), and let a lamina (or flat plate) cover S and have areal 
density (mass per unit area) 6(p,¢) at the point (p,¢) of S. The problem 
is to find the pth moment of the lamina about the y axis shown in Figure 


Figure 13.57 


13.57. As before, we use p Ap A¢ as an approximation to the area of a 
subset of S. The next step is to use 


5(p,¢)p Ap Ad 


as an approximation to the mass of a part of the lamina. Multiplying 
this by x? or (p cos ¢)?, the pth power of the x coordinate of a point in 
the part, gives an approximation to the pth moment about the line x = 0 
of the part of the lamina. This leads to the formula 


M2, = lim E(p cos $)?85(0,¢)p Ap Ad 


for the required pth moment. The right member is a double integral, 
and expressing it as an iterated integral gives 


B S(¢) 
M2, = [Í cos o de [2° 8(0,4)0°#! dp. 


Example 3 We now require an integral for the polar moment of 
inertia Mm... yew Of the lamina of Example 2 about the line L through the 
Origin perpendicular to the plane of the lamina. We use p Ap Ag as 
an approximation to the area of a subset of the lamina and then use 
5(p,6)p Ap Ad as an approximation to the mass of the subset. Multi- 
plying this by p?, the square of the distance from the line L to the subset, 
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gives the approximation 


(13.58) 5(p,$)p* Ap Ad 
to the polar moment of the subset. This leads to the formulas 


Mme yey = lim 25(p,)p? Ap Ad 


and 


B J(e) 
Men =y ~ f. d¢ |; ° 5(p,$)p? dp. 


Problems 13.59 


1 Set up an iterated integral in polar coordinates for the volume ~ of the 
solid generated by rotating, about the y axis, the triangular set bounded by the 
lines having the polar equations ¢ = 0 and @ = B (where 0 < B < a/2) and 
the line having the rectangular equation x = A (where h > 0). Then evaluate 
the integral and discover that V = a h? tan b Remark: Correctness of the 
answer can be verified by use of elementary geometry, because the solid is 
obtainable by removing a part of a solid right circular cone from a segment of a 
solid right circular cylinder. 

2 Find the distance from the vertex to the centroid of a lamina having the 
foim of a circular sector of radius R and central angle 2a when each of the fol- 
lowing is true. 


(a) The lamina is uniform. Ans.: 5 sin æ R 

(b) The density is proportional to kth power of the distance from the vertex. 
k+2sine 

Ans.: 43 a R 


3 Using the equation p = 2a cos ¢, set up and evaluate an iterated integral 
in polar coordinates for the moment of inertia Jp of a circular disk about a line 
perpendicular to the disk and containing a point on the boundary of the disk. 


Ans.: 
h=? f M do fr r? 03 dp = Brat. 


4 Using the equation p = a, set up an iterated integral in polar coordinates 
for the moment of inertia Io of a circular disk about the axis of the disk (the line 
through the center of the disk perpendicular to the plane of the disk) when the 
density of the disk at each point is the pth power of the distance from the point to 
the diameter on the initial line (or x axis). Ans.: 


I = afr sin? $ do | p?t? dp. 


Remark: With the aid of the formula 
2 ° 2 ° 


a/2 , 1 
Í sin? x cos? x dx = = ——————— 
0 2 


CH) 
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appearing among the problems of Section 8.4, and the fact that (—$)! = +/g, 
we can put the result in the form 


T). 
Aarts fx/2 | > 2 /m artt 2 
l= 5 F4jo sinrddd= Gre (2): 


which is valid even when 9 is not an integer. 
5 A circular disk has radius a and constant density ô. Find its moment of 
inertia about each of the following: 


(a) A diameter (line, not number) Ans.: frat 
(b) A tangent in its plane Ans : ra'd 


6 Find the volume of the solid S obtained by rotating, about the x axis, 
the region in the first quadrant bounded by the x axis and the graph of the polar 
equation p = a +/cos ġ. Outline of solution: The volume |S| of Sis approximated 
by the sum of “‘elementary” rings a sample one of which has cross-sectional area 
pz Ap; Ad, and length 2p; sin dy This leads to the formula 


[S| = an fo" sin ġ dọ N cose 2 dp. 
0 0 
Ans.: 4ra*/15. 


7 Supposing that the solid S of Problem 6 has constant density 6, find the 
gravitational force F which it exerts upon a particle m* of mass m concentrated 
at the origin. Outline of solution: The sample elementary ring of the preceding 
problem has mass Mp, where 


M, = 215 sin prp Ads Ape. 


The force AF, which this ring exerts upon m* is the same as the i component of 
the force exerted upon m* by a single particle of the same mass Mg concentrated 
at the point having polar coordinates (pr or). Therefore, 


AF, = ; om Ms COS dy. 
Pk 


This leads to the formula 


F = 2rGmôi f° sin @ cos $ I 008 © dp. 


The answer is F = $raGm6i. Remark: We embark on a little excursion to see 
that the solid S of this problem and the preceding one is a most remarkable solid. 
If a particle P of mass M is located at the point in our plane having polar coordi- 
nates (p,@), or at a point which is obtained by rotating it part of the way around 
the x axis, then the scalar component F, in the direction of the x axis which it 
exerts upon m* is 


Fz = =e cos ¢. 


If P lies inside our solid S, then p < a ~/cos ¢, so p? < a? cos g, so (cos ¢)/p? > 
1/a*, soF, >GmM/a*. If P lies outside our solid S, then Fz <S 0 if r/2 < |¢| < 
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nxn and p>av/cos ¢ if |¢| <a/2. It follows that if P lies outside our solid, 
then F <GmM/a*. These results show that if we transfer material from the 
inside to the outside of our solid, we decrease the force which pulls m* in the 
direction of the positive x axis. This fact and the answer to Problem 6 give the 
following interesting conclusion. Of all solids having volume 4ra?/15 and uni- 
form density ô, our solid S is the one and only solid which exerts, upon a particle 
at the origin, the greatest force in the direction of the positive x axis. The force 
Fz exerted upon m* by a spherical ball B which has the same volume and density 
as S, and which has its center on the positive x axis and has the origin on its 
surface, is the first of the vectors 


Fg = 4/25 psraGmbi, Fs = 3a4sraGmii, 


and the force exerted by our solid S is the second one. Thus we have a new 
proof of the inequality 4/25 < 3. Tables say that 


4/25 = 2.924018. 


Spherical balls are not the best, but the best is only about 2.5 per cent better. 

8 Partly because the result is thoroughly important in probability and 
statistics and elsewhere, and partly because understandings of multiple and 
iterated integrals should be developed, this problem requires learning a standard 
method by which the formulas 


a) h e> dx -XE I. 


e= dx = 1 


1 co 
g7z?l20? dy = — 222] 2g? = g? 
(2) =i, dx = 1, a J dx = 0 


are derived. Supposing that h > 0, we define F (4) by the first of the two equiva- 
lent formulas 


h h 
(3) Fh) = | ede, Fh) = h d. 
Then 


(4) [F(A)}? = | h r dx | | h gi dy | = ih dx h gar) dy 
= f Io QCh) e+?) dx dy 


where we have, in order, the product of two integrals, an iterated integral and, 
finally, a double integral over the square region Q(h) of Figure 13.591. This 


Figure 13.591 


h hy? x 


694 Iterated and multiple integrals 


turns out to be useful because the double integral can be compared with other 
double integrals that are easily evaluated by use of polar coordinates. Let 
D(h) be the quadrant of the circular disk consisting of points having polar coordi- 
nates (p,@) for which O S$ p S handO S @ < 7/2 and let 


(5) G(h) = f Í Da) e+") dx dy. 


Then, because the integrands are everywhere positive, 


(6) GCh) < [F@P £ Gh V2). 


Writing (5) in terms of polar coordinates and evaluating the result by use of an 
iterated integral gives 


(7) GOH) = [fom pdh dp = i d f * p dp 
= f aiebo =F- e 


Since F(h) > 0, this and (6) give 
— h 
(8) MV -e S Í e= dx S < VE Vinee — e 


Taking the limit as h —> œ gives the first formula in (1), and the second formula 
in (1) follows from the first. The first formula in (2) is obtained from the second 
formula in (1) by a change of variable; the trick is to set x = t/+/2 o and then 
replace ¢ by x in the new integral. The second formula in (2) is obtained by 
integrating by parts and using the first formula in (2). Remark: The formulas 
obtained by replacing x by x — M in (2) are important. The function ® defined 
by 
1 _ (1-107 
(9) D(x) Vine e 


is the Gauss probability density function having mean (or average) M and standard 
deviation o. In appropriate circumstances, the number 


(10) ih B(x) dx 


is, when a < b, used for the probability that a number x (which could be the num- 


ber of red corpuscles per cubic centimeter in your blood) lies between a and b. 
The formula 


M) 


M +20 - eo i N 
e * dx = —— f e?l2 dt, 
~Ir = \/ de 70 


which is proved by use of the substitution (x — M)/o = 1, facilitates calculations 
of probabilities because the last member is tabulated as a function of X. Many 
students of anthropology, medicine, education, agronomy, and other branches 


(11) fe bt) dt = 
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of agriculture are required to study analytic geometry and calculus so they can 
start learning about these things. 


13.6 Triple integrals; rectangular coordinates We have par- 
titioned sets in Æ, and E% into subsets and used these partitions in the 
process of setting up Riemann sums and Riemann integrals of functions. 
One reason for the importance of what we have done lies in the fact that 
the methods are easily extended to provide information about triple in- 
tegrals of the form 


(13.61) JSA AP as 


in which S is a set in three-dimensional space E; in which it is sometimes 
convenient to suppose that we exist. 

Let S be a bounded set in Æ; which may be a spherical ball (the set 
consisting of the points inside and the points on a sphere) or any other 
bounded set in E; which has a positive volume |S|. As was the case 
when we defined double integrals, we do not allow any one brand of 
coordinates to dominate our work. We suppose that we have a bounded 
function f defined over S and use the symbol f(P) to denote the value 
of fatP. For example, f(P) could be the density (mass per unit volume) 
at P or the product of the density at P and the specific heat at P and the 
temperature at P. The first step in our approach to a Riemann sum is 
to make a partition Q (again the letter P has been preempted) of the set 
S into n subsets Sı, Sa ° °°, Sn. The only thing we require of the 
sets Si, So, © © © , Sn is that they be nonoverlapping, that their union 
be S, and that each one of them have positive volume. The notational 
transition from Riemann sums to Riemann integrals is facilitated by 
denoting the volumes of the sets S,, S2, * © © , Sn by the symbols AS, 
ASo, «+ © ,ASn. Thus, foreach k = 1,2,°--°,2, the symbol AS; does 
not stand for a part of the set S; it stands for the volume of a part of S. 
For each k = 1,2, ° °° , n, let Px be a point in the set Sx. The number 
RS (Riemann sum) defined by 


(13.62) RS = ) f(Px) AS, 
= k=l 


is then a Riemann sum formed for the function f and for the partition 
Q of S. The norm |Q| of the partition Q is, as in previous cases, the 
greatest of the diameters of the subsets. If there is a number J such that 
to each e > 0 there corresponds a 6 > 0 such that 


(13.621) |7- ) f(Px) AS: | < € 
k=1 
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whenever the sum is a Riemann sum formed for the function f and for a 
partition Q of S for which |Q| < ô, then f is said to be Riemann integrable 
over S and I is said to be the Riemann integral of f over S. The integral 
is usually denoted by the symbol 


(13.622) J j I, f(P) dS. 


The integral is called a triple integral, and the three integral signs serve 
to remind us that S is a three-dimensional set, that is, a set in £3 having 
positive volume. As in previous cases, it is a convenience (and sometimes 
also a source of confusion) to introduce the notation of limits and write 


(13.623) J [ I, AP) dS = lim, p f(Py) AS: 
or 
(13.624) [[[,f®) dS = lim > f@®) as. 


The following theorem, which is analogous to Theorem 13.38, is very 
useful. 

Theorem 13.63 If S is a subset of a region R consisting of points 
(x,y,z) for which a, Sx S as, bi Sy S ba, c1 S 2 L ca, of f(x,y) = 0 
when (x,y,z) is a point in R but not in S, and if the eight integrals 


n = fff 1y 4s, I, = | J n f(y) dS 

In = [| dx [Pa [T iey) u= [Ë dx [7 d fe flxyy2) dy 
Is = h dy [dx [| feya dy h= fe dy f, de flasyya) dx 
Ir = [| da | ax fe fleys) dy, Ie = fld fo dy [ Flea) ax 


all exist, then 
L = I = L = l, = Is = lg = I = Ts, 


that is, the eight integrals are all equal. 

Remarks analogous to those following Theorem 13.38 apply here. 
Proof of the theorem lies far beyond the scope of this course. We can 
be content with a hazy understanding of the fact that the triple integrals 
I, and I; will exist if f is bounded and the set D of discontinuities of f 
has volume (three-dimensional Lebesgue measure) 0. So far as ele- 
mentary applications to elementary problems are concerned, we can be 
sure that if the set S and the function f are bounded, then all of the integ- 
rals appearing in the theorem must exist and must have the same value. 

To develop a technique for setting up triple and iterated integrals, 
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z z= f(x,y) 
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we consider an example. As in Figure 13.64, let S be the set or solid 
bounded by the surfaces having the equations z = filx.y), 2 = fo(x,y), 
y = gi(x), y = g(x), x = ay, and x = a. We suppose that, at each 
point (x,y,z) of the solid, the solid has density (mass per unit volume) 
6(x,y,z). ‘This means that to each e > 0 there corresponds a ô > O such 
that 


(13.641) (x,y,z) — ae <e 

whenever AS is the volume of a part of the solid containing the point 
(x,y,z) and having diameter less than ô, and Am is the mass of the part. 
In case there is a constant & such that 6(x,y,z) = k whenever (x,y,z) is a 
point of S, the solid is said to be homogeneous. While the ideas can be 
applied in some other cases, we suppose that all of the functions which we 
have introduced are continuous. Supposing that xo is a given number 
and that # is a nonnegative integer that is 0 or 1 or 2 in most applications, 
we set up integrals for M%”,,, the pth moment of the solid about the plane 
x = xo Using rectangular coordinates, we slice S into subsets by planes 
parallel to the coordinate planes. A typical subset, such as one shown 
in Figure 13.64, has volume Ax Ay Az. Letting (x,y,z) be a point in the 
subset, we use the number 


(13.642) (x,y,z) Ax Ay Az 


(the product of mass per unit volume and volume) as an approximation 
to the mass of the subset. If the total mass of the subset were concen- 
trated at the point (x,y,z), its pth moment about the plane x = xo would 
be 


(13.643) (x — x0)?5(x,y,z) Ax Ay Az. 
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We therefore use this number as an approximation to the pth moment 
of the subset. The sum 


(13.644) x(x — x0)?8(x,y,z) Ax Ay Az, 


which contains a term for each subset, should then be a good approxima- 
tion to the total th moment of the whole solid whenever the diameters 
of the subsets are all small. This leads us to the formula 


(13.645) M®,, = lim B(x — xo)?6(x,y,2) Ax Ay Az, 


the right side of which is taken to be the definition of the number M? 
which we are seeking. In accordance with the definition of triple integ- 
rals, the right side of (13.645) is a triple integral which we can denote by 


one or the other of the symbols in the formula 
(13.646) M2, = J f _ (x — %0)?8(,y,2) dS 
= fs (x — xo) PS (x,y,z) dx dy dz. 


With the aid of Theorem 13.63, we can undertake to express the triple 
integral in terms of iterated integrals in various ways. It is, however, 
worthwhile to learn to use a procedure which leads directly to iterated 
integrals. As above, we build up the expression 


(13.65) (x — x0)?5(x,y,z) Ax Ay Az 


to serve as an approximation to the required moment of a single subset. 
We then form the sum 


(13.651) Ax Ay ` : (x — x0)?5(x,y,2) Az, 


z,y hxe 


where the part “x, y, fixed” of the symbol serves to inform us that the 
sum contains only terms arising from those subsets which comprise a 


single vertical column such as that shown in Figure 13.64. Thus (13.651) 
and the number 


(13.652) Ax Ay oii (x — xo)Pô(x,y,2) dz 


are approximations to the required moment of one column. Next we 
form the sum 


S2(z,y) 
13. i ~ 
(13.653) ax dy dy fn (£ — xe) 0(x,ys2) dz, 
where the part “x fixed” of the symbol serves to inform us that the sum 
contains only terms arising from columns that comprise a slab running 
from the cylinder on which y = y(x) to the cylinder on which y = y(x). 
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Thus (13.653) and 


g2(x) J2(z,y) 
(13.654) Ax dy Jn 


g1(z) 


(x — x0)?5(x,y,z) dz 
are approximations to the required moment of one slab. F inally, 


(2) S2(z,y) 
(13.655) dA he d y ree (x — xo)?ô(x,y,z) dz 


is an approximation to the required moment of the whole solid and 
replacing this by its limit gives the formula 


g? ) 4 S2(z,y) 
(13.656) MPa = f° dx [v4 y fe (x — x0)?8(x,y,z) dz 


for the required pth moment of the solid about the plane x = xo. 

Remarks very sımilar to those following (13.453) can now be made. 
Formulas for the pth moments of the solid about the planes y = yo 
and z = zo are obtained by replacing the factor (x — xo)? by the factors 
(y — yo)? and (z — zo)? in the above derivations. In case p = 0, the 
factors (x — xo)?, (y — yo)?, (z — zo)? are all 1 and the numbers M? 
M? p Miz, are all equal to the mass M of the solid. Thus 


(13.66) M = I ô(x,y,z) dx dy dz, 


and we can replace this triple integral by iterated integrals. Formulas 
very similar to (13.46) show that the formulas M®, = 0, M2, = 0, 
M32, = 0 which determine the centroid (%,,Z) of S can be put in the 
form 


z- T, xôlx,y,z) dx dy dz o I yò(x,y,z) dx dy dz 


Ih 5(x,y,2) dx dy dz = ffl, (x,y,z) dx dy dz 


is 26(x,y,2) dx dy dz 


f f h 5(x,y,z) dx dy dz 


where the denominators are equal to the mass of the solid. 

For some purposes, the polar moment of inertia of a solid about a line 
L is of importance. When the line L is the line having the equations 
x = xo y = yo, we may denote the polar moment of inertia about L 
by the symbol Me, ja, OF J2es.y-v, and work out the formula 


(13.68) Mize = Lemmvevs 

= fff 1 — x)? + (y — yo)5(x.v.2) dx dy dz 
which is analogous to (13.484). The formula 
(13.681) M2 yy, = M®,, + M%,, 


(13.67) 
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and the parallel axis theorem (Theorem 13.48) hold for solids as well as 
for lamina. 


Problems 13.69 


1 Set up and evaluate a threefold iterated integral for the volume V of the 
solid tetrahedron bounded by the coordinate planes and the plane having the 
equation 


an a an 
a + b + c 1, 
it being assumed that a, b, c are positive numbers. Ans.: 
T -74 
y = NE pE- dy S a7) dz = tabc. 
0 


2 Supposing that f is continuous over the tetrahedron T of the preceding 
problem, set up a threefold iterated integral equal to the triple integral 


J pf 952) dx dy dz, 


where, as is often done when rectangular coordinates are involved, dx dy dz is 
written instead of dT. Ans.: Same as answer to preceding problem except that 
the integrand is f(x,y,z) instead of 1. 

3 Set up a threefold iterated integral for the volume V of the solid in £} 
bounded by the parabolic cylinders having the equations y = x? and x = y? 
and the planes having the equations z = Qandx+y+2z2=2. Ans.: 


V= haf o h a 
0 a? 0 


4 A homogeneous cube has density 6 and has edges of length a. Find its 
moment of inertia about an edge. Ans.: $5a5 or $Ma*, where M is the mass 
of the cube. 

5 A long solid circular cylinder S of radius b has its axis on the y axis of an 
x, y, z coordinate system. A circular hole having radius a and having its axis 
on the z axis is drilled. Supposing that 0 < a S b, set up an integral for the 
volume V of the part of § that is drilled away. Ans.: Because of symmetry, 
V is 8 times the volume of the part in the first octant and 


_ a /a?~ x? b= 2! 
y = 8 |, dx f dy i dz. 


6 Let g be a nonnegative constant and let S be a spherical ball of radius R 
whose density is proportional to the gth power of the distance from the center. 
Taking the origin at the center of the ball, set up a triple integral for the polar 
moment of inertia of the ball about the z axis. Simplify matters by using the 


fact that the total moment is 8 times the moment of the part of the ball in the 
first octant. Ans.: 


R JRi—y: f 4/Rt—z3—yt 
Bf D [VP VPM e H yt + A ae 
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Remark: Section 13.8 will enable us to avoid this and some other unpleasant 
integrals. 


7 The tetrahedron (or pyramid) bounded by the planes x = 0, y = Q, 
z = 0, and 


X ee A 
atto! 


has density (mass per unit volume) ô(x,y,z) at the point (x,y,z). Supposing 
that a, b, c are positive constants and that p is a nonnegative constant, set up two 
different iterated integrals for the pth moment MP, of the solid about the plane 
x =0. Ans.: One of the possibilities is 


MPa = k x? pend dy pEi) ô (x,y,2) dz. 


8 For the case in which ô(x,y,z) = 1, so that the solid is homogeneous, show 
that the last formula of Problem 7 can be put in the form 


M2, = <s xP (1 — Ey dx. 
0 a 
At least when p = 0 and p = 1, show that 


| 1 
MP = _ Pt tthe = aH 
GE’ -GtHet+DetH” © 
Finally, show that ž = a/4. 
9 LetO <bSa. A spherical ball of radius a has its center at the origin. 
Set up a threefold iterated integral for the volume V of the part of the ball drilled 
away when a bit of radius drills a cylindrical hole centered on the line having the 


equations x = 0, y = a — b. Symmetry may be used, and the integral need not 
be evaluated. Ans.: 


a 
yaa" dy 


10 Show that when b = a, the answer to Problem 9 reduces, as it should, to 
an integral for the volume of the whole spherical ball. 

11 Assuming that the spherical ball of Problem 9 has density 6(x,y,z) at 
P (x,y,z), modify the integral of the preceding problem to obtain an integral for 
the amount by which the drilling of the hole decreases the polar moment of inertia 
about the z axis. Ans.: Multiply the invisible 1 preceding dz by the factor 
5 (x,y,z) (x? + y?). 

12 Modify the answer to Problem 9 to obtain an integral for the volume of the 
material drilled from the ball when the hole is centered on the line which meets 
the y axis where the surface of the ball does. 

13 Supposing that a > 0, evaluate 


Ulpr® 


[OO te [NOE 
0 0 
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where S is the cube having four of its vertices at the points (0,0,0), (2,0,0), 
(0,2,0), (0,0,2). Ans.: 


$(1 + 3a)? log (1 + 3a) — $(1 + 2a)? log (1 + 2a) + Ẹ(1 + a)? log (1 + a). 


14 Supposing that a, b, c are positive numbers, evaluate 


IY, sin fx (z +3 + =) aT, 


where T is the tetrahedron having vertices at the points (0,0,0), (2,0,0), (0,4,0), 
(0,0,c). Ans.: (m? — 4)abe/2x?. 


13.7 Triple integrals; cylindrical coordinates In some cases a 
solid S (or set S in Æ; having positive volume) and a function f defined 
over S are such that the triple integral defined as in (13.624) by the 
formula 


(13.71) JJA; 1P) dS = lim > AP) as 


can advantageously be expressed in terms of cylindrical coordinates p, 
$, z. When we use cylindrical coordinates, the set S is partitioned into 


subsets Sı, S2, © © © , Sn by planes through the z axis having cylindrical 
equations ¢ = ġo $ = $1, °° * , $ = $m, by circular cylinders having 
cylindrical equations p = po, p = pi, © ° © , P = pm’, and by planes parallel 


to the xy plane having the cylindrical equations z = x, 2 = 2, ° °°, 
Z = Zw’. Figure 13.72 shows a typical subset containing a point having 


he iÁ wee one tase oe 
/ 
/ 


Figure 13.72 


cylindrical coordinates p, ¢, z. This subset has height Az and, as we 
learned when studying polar coordinates, its base has area exactly or 
approximately equal to pA@ Ap. Thus we use the formula 


(13.73) AS = pAd Ap Ax 
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to put (13.71) in the form 


(13.74) Jff fotado dọ dp dz = lim » f(p,4,2)p Ad Ap Az. 


Assuming that f is bounded over S and that f is sufficiently continuous to 
make all of the integrals exist, we can use Theorem 13.63 to express this 
triple integral as an iterated integral. When limits of integration for 
iterated integrals are being determined, information obtained by looking 
at Figure 13.72 can be helpful. Adding subsets for which z varies (p 
and ¢ being fixed) yields a vertical column. Adding columns for which 
p varies (¢ being fixed) yields a whole or a part of a wedge which in some 
cases looks like a conventional wedge of a cake or orange or lemon. 
Adding the wedges obtained for appropriate values of ¢ then gives the 
entire solid S. Results of performing summations and integrations in 
different orders are easily described. For example, adding subsets for 
which ¢ varies (a and p being fixed) yields all or part of a circular hoop 
or ring, and there are two ways in which these hoops can be added to 
yield more extensive parts of S. 

Supposing that S is a right circular cylindrical solid bounded by the 
graphs of the equations p = R, z = 0, and z = H and that the density 
(mass per unit volume) at the point having cylindrical coordinates 
(p,%,2) is 5(p,,z), we set up an integral for the polar moment of inertia 
M20 Of S about the x axis. For the volume AS of a subset of the 


Figure 13.75 


solid, which is shown in Figure 13.75, we use the formula 
(13.76) AS = p Ad Ap Az. 


To get the mass AM of the subset, we multiply by the density (mass per 
unit volume) 6(p,¢,z) to obtain 


(13.761) AM = 8(p,¢,2)p Ad Ap Az. 
Then we must realize what we are trying to do and multiply this by 


(13.762) z2 + (p sin $)‘, 
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the square of the distance from the subset to the x axis, to obtain the 
expression 
(13.763) [z2 + p? sin? $]6(p,,z)p Ad Ap Az 


for the polar moment of the subset about the x axis. Adding these and 
taking the limit of the sum gives 


(13.764) Moo = | | I, [22 + p? sin? $]5(p,¢,2)0 dé dp dz. 


This is, as it should be, the sum of the moments of inertia of S about the 
two planes z = Qand y =0. The limits of integration being determined 
with the aid of Figure 13.75, we can obtain the iterated integral formula 


(13.77) M240 = h de fy d h [z? + p? sin? 6]8(p,4,2)p dz. 


In order to be able to evaluate this integral in decimal form, we must 

know R, H, and 8(p,¢,z). For the special case in which g is a nonnegative 

constant and 8(p,¢,z) = kp, we can evaluate the integrals in terms of 
R, H, q, and & to obtain 

H? Rat2 Rati 

2 — De nms —— |. 

(13.78) M2, wo = Å E 7742 + rH 7+ z| 


The result for the case in which the cylinder is homogeneous is obtained 
by setting g = 0. 


Problems 13.79 


1 Supposing that 0 < b S a, set up and evaluate a threefold iterated integral 
in cylindrical coordinates for the volume V of the solid lying inside the sphere 
and cylinder having the cylindrical equations p? + z? = a? and p = b. Ans.: 


y= F dọ fy odo fy” dz = frfa? — (a? — b338]. 


2 Supposingthat0 < b < a, set up and evaluate a threefold iterated integral 
for the mass M of the solid lying inside the sphere but outside the cylinder having 
the cylindrical equations p? + z? = a? and p = b, it being assumed that the 
density of the body at P(p,¢,z) is |z|. Ans.: 


x/2 a V/ a?— p? 
m=8f d f pdo f, "ads = (a — BA 


3 Set up and evaluate a threefold iterated integral in cylindrical coordinates 
for the volume V of the solid bounded by the sphere and cylinder having the 
cylindrical equations p? + z? = a? and p = a cos ġ. Ans.: 


y =a [ag [S pap [YO a 


x/2 2 
= £43 — gin3 = 4 wed. 
$a f (1 — sin? ġ) dọ = Fa? É 5| 
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4 Assuming that the solid cone shown in the upper part of Figure 13.791 
has density (mass per unit volume) 5(p,¢,2) at the point having cylindrical coordi- 
nates (p,¢,z), set up an iterated integral for the moment of inertia of the cone 


Figure 13.791 


about the x axis. Then calculate the required moment for the case in which the 
density is proportional to the gth power of the distance from the axis of the cone, 
that is, 6(p,9,z) = kp?. Ans.: The integral is the same as that in (13.77) except 


that the lower limit of integration with respect to z is (H/R)p. The required 
moment is 


2r T 
k la EDGES BRO + GE HG ES t] 


5 Show how the preceding problem gives the conclusion that the moment 
of inertia of a homogeneous conical solid, having density 6 and height H and 
base radius R, about a line through the vertex perpendicular to the axis is 
gowR*H (4H? + R26. 

6 A solid cylinder having constant density 6 is bounded by the cylinder 
and planes having the equations p = a, z = 0, and z = h. Set up and evaluate 
a threefold iterated integral in cylindrical coordinates for the moment of inertia 
I of the solid about the x axis. Ans.: 


=f" fi d f e+ 2 sin? ọ) dz = mlati . 
= o PJ PPh & p? sin? @) dz = ra 3 4 

7 A conical solid has height h, base radius a, and density kz, where k is a 
constant and z is the distance from the base. Find the mass M of the solid 
and the distance Z from the base to the centroid. Ans.: M = pPywath?k, z = $h. 

8 A conical solid has height h, base radius a, and density kp, where k is a 
constant and p is the distance from the axis. Find the mass M of the solid and 
the distance Z from the base to the centroid. Ans.: M = yra*hk, i = gh. 

9 A cup-shaped solid S is obtained by rotation about the z axis of a region 
Rin the yz plane bounded by the graphs of the equations 


z= x, g=x +l, z = 10. 


The density (mass per unit volume) of S at the point having cylindrical coordi- 
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nates p, $, zis 6(p,¢,z). Set up a threefold iterated integral in polar coordinates 
for the total mass M of the solid. Ans.: 


2x 3 pt-+1 2x v10 J pe 
M =|, dé f p dp f 5(p,$,2) du + fi d |. p ap p Ô (x,5,2) dz. 


10 As we near the end of our textbook, we can and should review and sum- 
marize some of our ideas about integrals. This problem does not require us to 
produce a specific numerical answer to a specific problem; it requires us to thinh 
in general terms about methods by which such answers are produced. With the 
understanding that the ideas have applications to more complicated situations 
as well as to simpler ones, we consider the gravitational force F exerted upon a 
particle of mass m at a point Qin E; by a body B. This body B may be one- 
dimensional, that is, it may be concentrated upon a one-dimensional set S which 
may be a line segment or an arc of a curve having positive length. In this case 
we suppose that the body has linear (or one-dimensional) density 6(P) at the 
point Pin S. The body B may be two-dimensional, that is, it may be concen- 
trated upon a two-dimensional set S which may be a circular dish or some other 
region (on a plane or curved surface) which has positive area. In this case, we 
suppose that the body has areal (or two-dimensional) density 6(P) at the point 
P in S. Finally, the body B may be an ordinary three-dimensional solid body, 
that is, it may occupy a set S in £3 having positive volume. In this case we 
suppose the body has ordinary (mass per unit volume) density 6(P) at the point 
P in S. We simplify and unify our discussion of these things by considering 
length to be one-dimensional measure, area to be two-dimensional measure, and 
volume to be three-dimensional measure. Thus we handle all of our examples 
together by saying that we have a body B occupying an n-dimensional set S in 
E; having positive n-dimensional measure |S| and that the body has n-dimensional 
density 6(P) at the point P in S. The integer n may be 1 or 2 or 3. To start 
the process of calculating F, we make a partition of the set S into g (note that n 
has been preempted) measurable subsets Sı, So, © © © , Sẹ It would be thor- 
oughly reasonable to denote the measures of these sets by |Sil, [Sal, -- - , 
[S,|, but we find it convenient to denote the measures by AS,, AS, - - - , AS,. 
Thus, for each k, AS; is not a part of S; it is the measuref of a part of S. The 

next step is to select a point P, (or P*) in S} Note 

ee that Px is not in AS, but that P, is a point in the set 

oo of which AS, is the measure. We use the number 
6(P;,) AS; to approximate the mass of the part of the 

body occupying the set Są. For each k, we apply the 

Figure 13.792 Newton inverse square law to obtain the force AF, 
which a particle of mass 6(P;,) AS, at Pp exerts upon 


the particle of mass m at Q. Even small figures can be helpful, and we can 
look at Figure 13.792. We find that 


(1) ap, = CMP) AS: OP 
IQP? IQP; | 


t Perhaps we should recognize the fact that the very useful standard notation is a relic 


of the good old days when it was not the fashion to recognize a difference between a set and 
the measure of the set. 
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where G is the he gravitational constant and the last factor is a unit vector in the 


direction of OP; Making a slight modification of the right member of (1) and 
adding give the right member of the formula 


6(P 
(2) = Gm 5 3 ( :) QP: AS, 
v1 [OP 
which is a Riemann sum (RS) formed for the vector function having the value 
5(P)OP 
(3) ToP 
IQP; 


at the point P of our set S. When we are in a hurry, we use the idea that the 
Riemann sum should be near the force F whenever the norm of the partition is 
small as a basis for introducing the definition 


q 
(4) F = lim Gm Y SPOOPs 4 
i [OP 
Since the limit of the Riemann sums is a Riemann integral, we write 
s TOPR 


the symbol on the right being an orthodox symbol for the integral. When we 
wish our notation to be as informative as possible, we can use n integral signs 
when S is n-dimensional. The fact that different notations are used at different 
times need not disturb us, because in any particular application we can be 
expected to know the dimensionality of the set we partition. We can, when we 
are unhurried, be more precise about the meanings of (4) and (5). The integral 
is, when it exists, the one and only vector F such that to each positive number e 
there corresponds a positive number 6 such that 


ô(P 5(P.) OP, 
F — Gm AS 
n X EER, | OP,|* : 


(6) 


whenever the Riemann sum is formed for a partition whose norm is Jess than ô. 
For some purposes, it is important to observe that (5) is an intrinsic formula 
which does not depend upon any one particular coordinate system which may 
be used to specify the positions of the points involved in the problem. For 
other purposes, particularly when problems in elementary books are being solved, 
it is necessary to introduce a coordinate system. The raison d'être of different 
coordinate systems lies in the fact that different ones are most useful in different 
situations. 


13.8 Tripleintegrals; spherical coordinates Insome cases a solid S 
(or a set S in Æ; having positive volume) and a function f defined over S 
are such that the triple integral defined by the formula 


(13.81) ff, #@ 4 = tim ) fP) as 
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can advantageously be expressed in terms of spherical coordinates r, ¢, 0. 
When we use spherical coordinates, the set S is partitioned into subsets 
Si, Se, © © © , Sn by spheres having spherical equations r = ro, r = p, 
- + + T = tm, by half-planes having spherical equations ¢ = ġo, ¢ = qy, 
- + + , $ = dm, and by half-cones having the spherical equations 0 = 6y, 
0 =b, °°: ,6= On. Figure 13.82 shows a typical subset containing 


Figure 13.82 


a point having spherical coordinates (r,ġ,0). When r > 0 and the num- 
bers Ar, Ad, A0 are all small, this subset closely approximates a rectangu- 
lar parallelepiped one dimension of which is Ar, the difference of the radii 
of two spheres. The inner (or outer) face perpendicular to the ray from 
the origin to the point (7,¢,0) closely approximates a rectangle one side 
of which has length r A@ (the length of the arc of a sector having radius 
r and central angle A@) and the perpendicular sides of which have length 
r sin 0 Ad (the length of the arc of a sector having radius r sin 6 and cen- 
tral angle Ad). Thus the area of the face is approximately r? sin 6 Ad A0. 
Thus we use the formula 


(13.83) AS = 7? sin 0 Ag AQ Ar 


(which, depending upon the choice of r, ¢, 0, is exactly or approximately 
correct) to put (13.81) in the form 


(13.84) f J Í, _J(r,4,8)r? sin 0 dẹ dê dr = lim D f(1,0,6)r? sin 0 Ad AO Ar. 


Assuming that f is bounded over S and that f is sufficiently continuous 
to make all of the integrals exist, we can use Theorem 13.63 to express 
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this triple integral as an iterated integral. When limits of integration 
for iterated integrals are to be determined, information obtained by 
looking at Figure 13.82 can be helpful. Adding subsets for which 7 
varies (¢ and 0 being fixed) yields a spike or tapered column. Adding 
spikes for which 0 varies (¢ being fixed) yields a whole or a part of a wedge 
which in some cases looks like a conventional wedge of an orange or 
lemon or cake. Adding the wedges obtained for the appropriate values 
of @ then gives the entire solid S. Articulate persons can describe results 
of performing summations and integrations in other orders. For exam- 
ple, adding subsets for which ¢ varies (r and 0 being fixed) yields all or 
a part of a circular hoop or ring, and there are two ways in which these 
hoops can be added to yield more extensive parts of S. 

Supposing that S is a spherical ball having center at the origin and 
radius R and that the density (mass per unit volume) at the point having 
spherical coordinates (7,9,6) is 6(7,$,6), we set up an integral for the polar 
moment of inertia Mi) ,2) of the ball about the z axis. For the volume 
AS of a subset of the ball, we use the formula 


(13.85) AS = 7? sin 6 Ad A9 Ar 


which, like the telephone number of a dentist, is sometimes needed but 
is usually not permanently remembered. To get the mass AM of the 
subset, we multiply by the density (mass per unit volume) 6(7,¢,6) to 
obtain 


(13.86) AM = ô(r,ġ,0)r? sin 6 Ag A0 Ar. 


Then we must be wise and strong enough to multiply this by (r sin @)?, 
the square of the distance from the subset to the z axis, to obtain the 
expression 


(13.861) 5(r,6,0)r* sin? 6 Ad A0 Ar 


for the polar moment of the subset about the z axis. Adding these and 
taking the limit of the sum gives 


(13.87) M®, a9 = | ji f. 8(r,6,0)r4 sin? 0 do dé dr. 


Since S is an entire sphere with center at the origin and radius R, 


x x R . 
(13.88) Mp, = h do f do fi 8(r,4,0)r* sin? 8 dr. 


Problems 13.89 


1 Let S be a spherical ball of radius a. Supposing that the center of the 
sphere is at the origin and that the density is ô(r,ġ,0) at the point having spherical 
coordinates (r,$,0), set up an iterated integral for the mass M. Arrange the 


710 Iterated and multiple integrals 


order of integration in such a way that the last integration is with respect tor 
and simplify the result as much as possible for the special case in which the 
density is a function of r alone, say ô(r,ġ,0) = f(r). Ans.: 


_ a T 2r 2.° = 4r a 
M = h dr [ ao f 6(7,6,8)r sin 6 dd, M h r?f(r) dr. 


2 Show how the last result of Problem 1 can be obtained by direct use of 
spherical shells and without use of iterated integrals. 

3 Solve the modification of Problem 1 in which S is a spherical shell bounded 
by concentric spheres having radii rı and ro. 

4 Let q be a nonnegative constant and let S be a spherical ball of radius a 
whose density is proportional the gth power of the distance from the center. 
Using the formula 

r? sin 0 Ad AO Ar 


for volume in spherical coordinates and taking the origin at the center of the ball, 
set up and evaluate a triple integral for the polar moment of inertia of S about 


the z axis. Ans.: The triple integral is obtained by setting ô(r,ġ,0) = kre in 


. . Srkarts 
(13.88). The required moment is 3(q +5) 


5 Show how the preceding problem gives the conclusion that the moment 
of inertia, about a diameter, of a spherical ball having radius a and uniform 
density ô is gra56. 

6 A solid spherical ball of radius a has, at each point P, density equal to 
the product of the distances from P to the origin and to the axis from which 6 
is measured. Set up and evaluate a threefold iterated integral in spherical 
coordinates for the mass M of the ball. Ans.: 


2r r a 
M h d fy sin gdo | rar gras, 


7 This problem involves lengths of curves. Suppose that, as time ¢ increases 
from a to b, a particle P moves along a curve C in such a way that its rectangular 
coordinates x, y, z% its cylindrical coordinates p, $, z% and its spherical coordinates 
r, $, ô are all functions of t having continuous derivatives. Start with the formula 


o t= [’ \(Z) +@) +) 4 


giving the length L of the curve C as an integral involving rectangular coordinates. 
Use the formulas 


(2) x = pcos @, y = p sin @¢, Z=% 


to obtain the formula 
b do 2 dp\? dz\2 
= 2o( LE LE -2 
©) L J, ve () +(3) +(%) at 
giving L as an integral involving cylindrical coordinates. Then use the formulas 


(4) p=rsin 6, $ = 9, z = yr cos 
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to obtain the formula 


b do \? d@\? dr\? 
TATTOO 
(5) a Ni sin? 6 dt tr? di + di at 
giving L as an integral involving spherical coordinates. 


8 Using ideas and formulas from the preceding problem, start with the 
formula 


(1) r= xi+ yj + zk 
for the vector running from the origin to P at time t. Show that 
(2) r = r(cos ġ sin ĝi + sin @ sinOj + cos 6k). 
Show that the velocity at time / is 

dr dô . ad 
(3) veut Guster sin ou, 
where 
(4) U; = cos @ sin ĝi + sin @ sin 6j + cos 6k 
(5) u: = cos ġ cos ĝi + sin @ cos Oj — sin Ok 
(6) u; = — sin gi + cos $j 


Prove that the vectors u;, Ue, Us, in that order, constitute a right-handed ortho- 
normal system. Show, finally, that 


PZS INE /dr\? 
2 _ —_—— — æ 
r sin? 9 (=) +7(F +(F) 


9 With the aid of hints that may be gleaned from the two preceding prob- 
lems, tell the meanings of the things in the formula 


(7) 


and give conditions under which the formula is valid. 

10 This problem provides an opportunity to fill in details and gain an under- 
standing of a method by which a great problem in cosmology is solved with the 
aid of vectors and integrals that involve rectangular and spherical coordinates. 
Figure 13.891 shows a spherical ball S of radius a having its center at the origin. 


Figure 13.891 
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It is supposed that the sphere has density 5(7,,8) at the point P having spherical 
coordinates (7,6,0). A particle of mass m is supposed to be concentrated at a 
point Æ which lies outside the ball and on the negative z axis, the rectangular 
coordinates of Æ being (0,0,—D), where D > a. We are required to determine 
and learn something about the gravitational force F upon the particle of mass m 
that is produced by the ball. We start with a basic idea of Newton that lies at 
the foundation of classical science. If particles of masses m and AM are located 
at points 4 and P, then each particle pulls the other toward it with a force of 
magnitude Gm AM/ [AP|?, where G is a universal gravitational constant whose 
numerical value depends only upon the units of force and distance that are used. 
The actual force upon the particle of mass m is obtained by multiplying this 
magnitude by AP/|AP|, the unit vector which has its tail at 4 and is pointed 
toward P. Letting AF denote this force, we have 


(1) AF = Gm AP. AM. 


|AP|* 


We need a useful formula for the vector AP. The rectangular coordinates 
x, y, z, the cylindrical coordinates p, $, z, and the spherical coordinates r, ¢, 6 
of the point P are related by the formulas 


= pcos ġ = r sin 6 cos ¢ 


y =psing = r sin 6 sin d 
z =r cos b 


which can be found in this and other books and can be derived from Figure 
13.891. Since 4 has rectangular coordinates 0, 0, — D, we find that 


(2) AP = r sin 0 cos di + r sin 0 sin dj + (D + r cos 6)k 
and hence that 

(3) [AP] = (D? + 2Dr cos 0 + r9». 

Using spherical coordinates, we employ the right member of the formula 
(4) AM = 6(r,¢,6)r? sin @ Ar Ad AO 


to approximate the mass AM of a subset (or element) of the ball containing the 
point P. Substituting in (1) gives the formula 


5(7,p,6)r? sin 0 


5 AF = Gm ——262e@ 8)? sin G 
6) ” (D? — 2Dr cos 0 + r9% 


AP Ar Ad A0 


in which the right side is an approximation to the force upon the particle of 
mass m produced by one subset of the ball. In (5) and some of the following 
formulas, ÆP is written instead of the right member of (2) to save time and paper. 
Supposing that the density function 6 is a reasonably decent function, we employ 
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partitions of the ball and principles of the integral calculus to obtain 


(6) F = lim È AF 
and 
ô(r,p,0)r? si — 
(7) F = Gm [|| etsin AP dr dọ dé, 


(D? + 2Dr cos 0 + 72) 


where the integral is a triple integral, S is the ball or the portion of E; occupied 
by the ball, and F is the total force on the particle of mass m. The formula 


F — G a 2q T sin 6 2r ——> 
®) niraj (D? + 2Dr cos 0 + 72)% a0 f,” 8(8.)AP do 


shows one of the six ways in which F can be represented as an iterate integral. 

The first phase of our work is done, and we proceed to see how (8) can be 
simplified when the density 6(7,¢,6) is independent of ¢ so that 6(7,6,0) = 
61(7,0), where ô; is a function of r and 0 only. In this case the last integral in 


(8) is 
(9) Í * Si(r,0)[r sin 6 cos gi + r sin Ø sin j + (D + 7 cos 6)k] de. 


With the aid of the fact that i sin @ dọ = 0 and ie cos ¢ dọ = 0, we see 
that this reduces to 


(10) 2rôı(r,0)(D + r cos 6)k 


and hence that (8) reduces to 


_ 3 2x §,(7,6)(D + r cos 6) sin 0 
(11) F = 21Gmk J. 72 dr J. TD FID oe 8B o. 


This shows that, when the density is independent of ¢, the components of F 
in the directions of the x and y axes areQ. Of course, wise scientists always claim 
that this must be true “‘on account of symmetry.” 

Our final step is to make additional simplification of (11) for the case in which 
the density is a function of r only, so that 5,(7,8) = 6e(r) and the ball is said to 
be radially homogeneous. One reason for interest in this case lies in the fact 
that suns and planets and moons are closely approximated by radially homo- 
geneous balls unless rapid rotations about their axes produce nontrivial equatorial 
bulges. When ô,(r,0) = 62(r), we can put (11) in the form 


(12) F = 2nGmk I rõa(r)f(r) dr 
where 

x (D +r cos 6)r sin 6 
(13) Fr) = I, (D? + 2Dr cos 0 + 1?)* dð. 


The integral in (13) may seem to be quite impenetrable until its fundamental 
weakness is discovered. If we set u = r cos 0, then (for each fixed r) du = —r 
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sin 6 dô and, except for algebraic sign, the integrand becomes that in 


r D+u 
When 0 = 0, we have u = r, and when @ = r, we have u = —r, and we see 
that (14) is correct when we see that — J i = | y We can discover that 
trading (14) for (13) was good business if we know or discover or are told that 
(14) can be demolished by the substitution 
’— D? - 7? 


v = D? + 2Du + r°, u = 


2D 
D +u = E, du = 7p 40. 
Since v = (D — r)? when u = —randv = (D + r)? when u = r, this substitution 
gives 
(15) f0) = qp fo. eae 


Since 0 < r < D, this gives f(r) = 2r/D?. Substitution in (12) then gives the 
first equality in 


Gmk pa 
(16) F = f, 4rr?ôa(r) dr = 


mM 
“or k 
As was shown in the first problem in this list, the integral in (16) is the total 
mass M of the sphere, and hence the second equality holds. The result embodied 
in (16) is the following famous theorem. Jf S is a radically homogeneous spherical 
ball, then the gravitational force which S exerts upon a particle outside S is equal to 
the force resulting from the assumption that the total mass of S 1s concentrated at the 
center of S. To help us understand the significance of this result, we should know 
some history. It is said that Newton mistrusted his whole theory of gravitational 
attraction (and therefore delayed publication of his theory for 20 years) until he 
was able to prove the theorem. 

11 IfS is a radially homogeneous spherical shell, then the gravitational force 
F which S exerts upon a particle inside S is 0. All scientists should know this 
fact and some should, when a suitable occasion comes, earn the satisfaction of 
discovering the modifications that must be made in the work of Problem 10 to 
prove the fact. 

12 With the aid of results of Problems 10 and 11, suppose that the earth is a 


homogeneous spherical ball and discuss the gravitational force upon a particle at 
the bottom of a very deep well. 


Appendix 1- 


Proofs of basic theorems 
on limits 


This appendix contains proofs of the basic theorems on limits which 
were given without proof in Section 3.2. Persons having competence 
in mathematical analysis must know these theorems and be able to give 
their proofs as thoroughly and as expertly as competent violin and piano 
players know and can play their scales. Individuals having nontrivial 
mathematical ambitions must therefore study the material of this appen- 
dix more than once. Most of the proofs depend upon the fundamental 
fact that if x and y are numbers, then 


(1) lx + yl s lel +o læ — yl s lol] + dol 
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The first basic theorem shows us that there can be at most one number L 


for which lim f(x) = L. 


Theorem A 
(2) If lim f(x) = Ly, lim f(x) = Le then L: = Li. 


Let e be a given positive number. Then e/2 is also a positive number 
which we could call e&. The first hypothesis of the theorem implies that 
there is a positive constant ô, such that 


(3) Ife) —Lil <5 


whenever x ¥ a and |x — a| < ô. The second hypothesis implies that 
there is a positive number 62 such that 


(4) I=) — Lal <5 


whenever x ¥ a and |x — a| < de. Let 6 be the lesser of ô and 5. 


Then, when x ¥ a and |x — a| < 4, the two inequalities (3) and (4) 
both hold and hence 


(3) Le — Lal = ife — Lil — Lf) — Lal 
< f(z) -Ll + Ife) -Ll <p t+ a6 


If we suppose that |L: — Li| = 0, then we could let e be the positive 
number |L: — Lı| and reach the false conclusions that |L: — Lh < 
|L: —L,| and 1 <4 and 2 < 1. Therefore, |L: — Lı| = 0 and hence 
L: = Lı. This proves Theorem A. The last part of the proof involves 


a principle that is very often used. If kis a number and if |4| < e when- 
ever ée > 0, then k = 0. 


Theorem B 
If b is a constant, then 
(6) lim b = b. 


This theorem tells us that if f(x) = b, where b is a constant, then 


(7) lim f(x) = b. 


To prove the result, let e > 0. Since f(x) — b = O for each x, we can 
let 5 = e and conclude that |f(x) — b| < e when x ¥ a and |x — al < ô. 
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Theorem C 


(8) lim x = a. 


This theorem tells us that if f(x) = x, then 


(9) lim f(x) = a. 


To prove the theorem, we observe that if e > 0 and we set ô = e, then 
|x — a| < « whenever 0 < |x — al < ô. 


Theorem D 
If c 15 a constant, then 


(10) lim cf(x) = c lim f(x) 


provided the limit on the right exists. 
To prove this theorem, let lim f(x) = L. In case c = 0, the result is 
wma 


a consequence of the fact that both sides of (10) are 0. In case c ¥ 0, 
let e > 0 and choose a positive number 6 such that 


(11) f@) - Lh <5 
when |x| = a and |x — al < ô Then 

(12) lef(~) — c lim f(x)| < e 
when x ¥ a and |x — a| < 6. This proves (10). 


Theorem E 

The formulas 

(13) lim [f(x) + g(x)] = lim f(*) + lim g(x) 

(14) lim [f(x)g(*)] = [lim f (+) ]{ lim g(x)] 
lim f(x) 

(15) lim 24) _ 22 


zag) lim g(x) 
zta 
are valid provided the limits on the right exist and, in the case of the last 
formula, lim g(x) ¥ 0. 
zta 
To prove these results, let 


(16) lim f(x) = L, lim g(x) = M. 
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Let e and e} and e be positive numbers. Choose positive numbers 
5, and ô such that 


(17) f(x) — Lh <a (0 < |x — al < 81) 
(18) lg(x) — M| < e (0 < |x — al < 6,). 


Let 8 be the lesser of 8; and 5,. Then, when 0 < |x — a| < 6, we have 


(19) IFE + g(x)] + M = IL) — L + el) ~— M 
S |f(~) — Ll + le) — M| < e + e. 


If we set e = €2 = ¢/2, then (16) and (19) give 
(20) I) + g()] — [lim f(x) + lim g(x)]| < e 
when 0 < |x — aj < ô This proves (13). 


To prove (14), we bridge the gap between f(x)g(x) and LM by sub- 
tracting and adding the term f(x)M and using (17) and (18) to obtain 


(21) Ig) — LM| = Iflg) — f(x) M] + @)M — LM] 
S | f(x)| le) — Ml + If) — LI M 
<S ((L| + eee + eM 


when 0 < |x — a| < 6. If we choose e, such that e,|M| < ¢/2 and after- 
ward choose ez such that (|Z| + e1)e2 < €/2, then (21) gives 


(22) lf(x)g(x) — LM| < e (0 < |x — al < ô). 
This proves (14). To prove (15), we begin by proving that 


1 > 
(23) im A T 


when lim g(x) = M and M #0. The more general result (15) will then 


follow from (14) and the fact that 


f) _ i l = im f(x)] | li A]. 
(24) lim Gy = lim e) ggj = Him f@] [tim ron 


To prove (23), we suppose that M = 0, that ez has been chosen such that 
0 < es < |M|/2, and that 6 has been chosen such that 


(25) lg (x) — M| < E9 

whenever 0 < |x — al < 6. Then 

(26) [M| = |g(x) — M — g(x)| S |g) — M| + |g(x)| < e + Ig(x)| 
and hence 


(27) le) > M- e> M-44 
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whenever 0 < |x — a| < ô. Therefore, 
l! iM — g(x) 


1 
g(x) M gM 


Eo _ 2e 


<7? n #2 
= (M/2)M Mv 


when 0 < |x — a| < 6. If we choose ez such that 2e2/ M? < e, we will 
have 


09) | 1 1 | 


g(x) fim g@) ~~‘ 
xa 


(28) 


whenever 0 < |x — a| < ô. This proves (15) and completes the proof 
of Theorem E. 


Theorem F 

If 

(30) lim f(x) = L 
then a 

(31) lim |f(x) — Z| = 0 


and conversely. 


The assertion (30) means that to each positive number e there corre- 
sponds a positive number 6 such that 


(32) fæ) -L| <« 


whenever 0 < |x — a| < 6. The assertion (31) means that to each 
positive number e there corresponds a positive number ô such that 


(33) | If) — L| — 0l < e 
whenever 0 < |x — a| < 8. Since the left members of (32) and (33) 


are equal, each assertion implies the other. 


Theorem G (sandwich theorem, or flyswatter theorem) 
If, for some positive number pb, 


(34) g(x) S f(x) S h(x) 
whena — p <x <aandwhena< x< a+ p, and if 
(35) lim g(x) = L, lim h(x) = L, 
then gi ™ 

(36) lim f(*) =L. 


The primitive idea behind this theorem may be phrased as follows. 
If two slices of bread (or two books) are near Minneapolis and if a slice 
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of ham (or a fly) is between them, then the thing that is caught in the 
middle must also be near Minneapolis. To prove this theorem, let 
e> 0. Choose ô such that 0 < 5 < p and the two inequalities 


(37) L—e<g(x) <Lt+e L—e<h(x)<L+e 
hold when 0 < |x — a| < 6. This and (34) give 

(38) L — e< g(x) Sfx) S A(x) Lte 

and hence 

(39) If(x) -Ll <e 


when 0 < |x — a| < ô. This proves Theorem G. 

Theorem 3.288, the last one of the basic theorems of Section 3.2, 

asserts that if p is a constant positive exponent and a 2 0, then 

lim x? = a’. 

za 
Proof of this theorem is much more difficult and is given in Section 9.2 
after the theory of exponentials and logarithms has been developed; 
see Theorem 9.271. 

We conclude this appendix with an indication of the extent to which 
mathematical fashions have changed. In a calculus textbook published 
in 1879 and cited in a footnote near the end of Chapter 3, W. E. Byerly 
says he “embodies the results of my own experience in teaching the 
calculus at Cornell and Harvard Universities.” His preface claims that 
one of the “peculiarities” of his book is “‘rigorous use of the Doctrine of 
Limits as a foundation of the subject.” His basic definition of limit 
appears on page 3. “If a variable which changes its value according to 
some law can be made to approach some fixed, constant value as nearly as 
we please, but can never become equal to it, the constant is called the limit 
of the variable under the circumstances in question.” The “‘fundamental 
proposition” in the theory of limits is given as a theorem on page 5: 


THEOREM. If two variables are so related that as they change they keep always 
equal to each other, and each approaches a limit, their limits are absolutely equal. 

For two variables so related that they are always equal form but a single 
varying value, as at any instant of their change they are by hypothesis absolutely 
the same. A single varying value cannot be made to approach at the same time 
two different constant values as nearly as we please; for, if it could, it could 
eventually be made to assume a value between the two constants; and, after 
that, in approaching one it would recede from the other. 


This appendix is based upon the premise that such “definitions” and 
“proofs” outlived their usefulness as their staunch defenders insisted 


that it is easier to learn them than to learn definitions and proofs involv- 
ing epsilons and deltas. ! 


Appendix 2 


Volumes 


This appendix involves volumes of sets in £3. Its purpose is to show 
that the theory of volumes is not simple. We shall reveal the fact that 
there is something inherently contradictory in the combination of the 
following four assumptions: 


(Aı) Each bounded set S in Æ; has a volume. 

(A2) If S, and Sz are congruent bounded sets in £3, then S, and S: 
have equal volumes. 

(As) Ifa bounded set S in E, is composed of two or three or four or five 
separate and distinct subsets, then the volume of S is the sum of 
the volumes of its subsets. 

(A4) IfS is a solid spherical shell having inner and outer radii for which 
0 < 71 < r, then the volume of S is positive. 
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Figure A.l shows three spherical shells the inner and outer radii of 
which are l and 2. A point P lies in one of these shells if its distance r 


Green shell Red shell Yellow shell 
5 subsets 3 subsets 2 subsets 


nema, Nive od 


Figure A.1 


from the center is such that 1 £r < 2. The shells are identical (or 
congruent) except that the first one is green, the second one is red, and 
the third one is yellow. It has been proved to be possible to separate 
the green shell G into five separate and distinct parts or subsets G,, Go, 
G3, G4, Gs, to separate the red shell R into three separate and distinct parts 
or subsets R,, Re, R3, and to separate the yellow shell Y into two separate 
and distinct parts or subsets Y, and Yz in such a way that 


R, M~ Gi, R: ow Go, R; ow Gs, Y, ow Ga, Y> ow Gs 


where the symbol “~” means “‘is congruent to.” If we make the first 


three assumptions listed above and use the symbol |S| to denote the 
volume of a set S, we obtain 


IG| = IG,| + IGal + IG3l + IGa| + IGs| 
= |R,| + |Ro| + [Rs| + Yl + |F 
= |R| + [Y| = |G| + |G} = 2|Gl. 


This implies that |G| = 0 and contradicts the fourth assumption (A4). 

Without undertaking to press very far into theories of volumes (these 
theories being a part of the more comprehensive theory of additive set 
functions in £3). we point out that it is possible to assign numbers (called 
volumes) to some of the sets in £3 in such a way that the following state- 
ments are true. 


(B,) Some sets in £3, including solid spherical shells having inner and 
outer radii for which 0 < r, < re, have positive volumes. 

(Bz) If S is a set in Æ; which has a volume, then each set in £3 which is 
congruent to S has a volume which is equal to the volume of S. 

(B;) If a set S in £3 is the union of a finite collection of separate and 
distinct subsets each of which has a volume, then S has a volume 
and the volume of S is the sum of the volumes of the subsets. 


The example involving the colored shells proves the following funda- 
mental fact. Whenever volumes are assigned to sets in £3 in such a way 
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that (Bı), (Bz), and (Bs) are valid, a contradiction arises from the assump- 
tion that each bounded set in Æ; has a volume. Thus there exist bounded 
sets in Æ; that do not have volumes. We are doing “rigorous mathe- 
matics” when we give a definition of volume and prove that a given 
spherical shell has a volume. We are still doing “rigorous mathematics” 
when we make and use clear statements of provable facts but postpone 
or omit the proofs. We are deep in the depths of intellectual degradation 
when, without having a definition of volume, we hold aloft a brick or 
spherical ball and convey (either explicitly or implicitly) the impression 
that the thing “obviously” has a volume. We hit the bottom when 
we say that the thing ts a volume. Unless we tolerate the idea that 
bad mathematics can be acceptable elementary calculus, we must avoid 
these degradations. Perhaps we can attain a reasonable view of this 
whole matter by recognizing the fact that modern set theory shakes the 
foundations of nineteenth-century mathematics as vigorously as modern 
atomic theory shakes the foundations of nineteenth-century physics and 
chemistry and engineering. Some of us will learn more about these 
matters than others, but we can all knowthat there is much to be learned. 


The Greek Alphabet 


Letters 
l 


Names 
alpha 
beta 
gamma 
delta 
epsilon 
zeta 
eta 
theta 


Letters 


OINMPIW> 
DBI T® wR WR 
SHoOomA~Ase KR 
yg oOo ww YS EK YR 


Names Letters Names 
iota P p rho 
kappa = oc sigma 
lambda T 7? tau 

mu T v upsilon 
nu b ¢ phi 

xi X x chi 
omicron vy y psi 

pi Q w omega 


Index 


Abel power series theorem, 629—630 


Abel value of series, 340-341 
Absolute convergence, 590-592 
Absolute (global) maxima and 
minima, 295 

Absolute value, 3, 150, 152 
Absolutely integrable, 609 
Abstract vectors, 586 
Acceleration, 179, 436—437, 544 

of gravity, 45 

radial, 437, 545 

transverse, 437, 545 
Algebra, of matrices, 94 

modern, 352 

of rational functions, 351 

of vectors, 48-110 
Algebraic equations, 448 
Algebraic functions, 448 
Algebraic graphs, 448 
Algebraic numbers, 449 
Alternating series, 610-611 
Altitudinal lines, 20-23 
Amplitude of sinusoid, 182 
Anchor ring (see Torus) 
Angles, 10, 16, 50-51, 67, 73, 439 
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Angular frequency, 39 
Antiderivatives, 203 
Any (troublesome word), 124 
Approximation (s), 123 
by differentials, 194-201, 568 
to exponentials, 488 
to integrals, 276-283 
and limits, 122-144 
to logarithms, 488 
Newton method of, 318 
to square roots, 8-9 
successive, 8-9, 318 
Arbitrary constant, 204 
Arc(s) of circle, 438-439 
Arc length (see Length) 
Archimedes, 212, 224, 234 
Archimedes integral, 349 
Archimedes property, 5, 320, 352 
Archimedes spiral, 532 
Area, 92, 229-230, 235-244, 348 
in polar coordinates, 547-550 
of surface, 425, 447-448, 550 
Aristotle, 386 
Arithmetic means, 341-342, 503- 
504 


Arithmetico-geometric mean, 343 
Astroid (see Hypocycloids, of four 
cusps) 
Asymptotes, 135 
of hyperbolas, 381 
Asymptotic cone, 407 
Attraction (see Gravitational 
attraction) 
Average rate, 176 
Average value (see Mean value) 
Average velocity, 178 
Axes, 2, 10, 59 
change of, in Ea, 107, 392-402 
in E3, 101 
rotation of, 107, 394-402 
of symmetry, 37 
translation of, 392-394 
Axioms, 9, 34, 313, 343-344 


Bachelor’s guide, 41 

Bands of Mobius, 676 

Base e (see Exponential base e) 

Beam, strongest cut from log, 301 

taken around corner, 444 

Bending upward, downward, 305 

Bernoulli functions, 231-233, 312, 
613-614, 640-651 ' 

generating function of, 650-651 

Bernoulli numbers, 232, 617, 641- 
651 

Bessel inequality, 612 

Beta integral, 228, 456, 476, 521, 
686-687 

Binary bits, 340 

Binomial coefficients, 139-140, 
647-650 

Binomial formula, 140, 634-635 

Binomial series, 634-635 

Binormal to curve, 436 

Black box, 121 

Boldface, 43, 49 

Boom-and-bust processes, 322 

Bouncing ball, 598 
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Bound(s), 212, 314-315, 323, 344, 
590 


Boundary, 293, 551 

Boundary conditions, 208 

Bounded function, 212 

Bounded variation, 352-335 

Bracket symbol, 227 

Bumblebee flights, 150, 177-179, 
409, 540 

Burington tables, 206 


Byerly calculus textbook of 1879, 
201, 720 


Calculus (slippery word), 224 
Calories (of heat), 149, 200-201 
Cams, 186 
Canonical printer’s error, 27 
Cardioid, 531, 544, 549 
Carpenter’s number system, 343 
Catenary, 435, 510 
Cauchy sequences, 598-599 
Cavalieri theorem, 283 
Center, of curvature, 429 
of gravity (see Centroids) 
of symmetry, 37 
Central conics, 368, 390-392 
Central forces, 545 
Centroids, 23, 54, 121, 261, 271, 
273, 679-680, 699 
Cesaro value of series, 342 
Chain rule, 165-166, 564-565, 
575, 578 
Change, of coordinates, 101, 107, 
349-402 
of variables, 221-223, 469-479 
Change of order, of differentiation, 
557-559 
of integration, 664, 669, 696 
Characteristic of logarithm, 485 
Circle(s), 25, 29 
of curvature, 429 
Circumcircle of triangle, 21-23 


Cissoid (of Diocles), 256, 535-536 
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Closed curve, 416 
Closed interval a S$ x S 5,5 
Commutative operations, 556 
Comparison test for convergence, 
591 
Complete orthonormal sets, 613 
Completeness of real numbers, 313 
Completion of squares, 26 
Complex exponentials, 628, 651 
Complex numbers, 139, 505-506, 
516-517, 628 
Components, of plane sets, 417 
troublesome word, 61 
of vectors, 61, 179 
Composite functions, 165 
Composite integers, 120 
Computers, 121, 280, 282, 340, 
453, 458, 468, 486, 512 
Concave, 305 
Conchoid of Nicomedes, 536 
Cones, 64, 76, 361, 369, 570 
volume of solid, 245-246 
Cones and conics, 361—407 
Confocal conics, 387 
Confocal ellipses, 378 
Conics, 361-407, 539, 551 
in polar coordinates, 539 
Conjugate hyperbolas, 382 
Connected set, 416 
Connecting rod, 191 
Conservation of energy, 187 
Conservative force fields, 657- 
658, 675 
Constants, 15 
of integration, 204 
Continuity, 144 
basic theorems on, 314-317, 326 
over closed interval, 146 
of functions of n variables, 557 
uniform, 326 
unilateral, 145 
Convergence, of sequences, 335 
of series, 336, 587 
Convex sets, 293, 551 


Indes 


Convexity of curves, 305 
Coordinates, on curve, 421 
cylindrical, 527 
on line, 2 
in plane, 10 
polar, 528-552 
rectangular, 10, 59 
in space, 59, 101 
spherical, 527 
Corn-popper function, 151-152, 
220 
Corners, 293 
Correlation, 77 
Cosine, power series for, +58 
Cosmology, 606, 711-714 
Coulomb law, 255 
Countable collection, 216, 236 
Critical values (derivative zero), 
295 
Cross products (see Vector 
products) 
Curl (of vector), 561, 673 
Curly dees, 554 
Curvature, 432-433 
in polar coordinates, 535 


Curve(s), 150, 177-179, 408-437, 
540 


closed, 416 
having finite length, 410 
having infinite length, 416 
in polar coordinates, 538-552 
simple, 417 
on spheres, 184-185 
on torus, 186 
Curve integrals, 426-428, 673-676 
independent of path, 658, 676 
Cusp, 414 
Cyclic advance of subscripts, 21 
Cycloids, 311, 399-400, 422, 435 
Cylinders, 83, 403 
Cylindrical coordinates, 527, 532- 
533, 582, 703-707 
triple integrals in, 702-707 
Cylindrical shell method, 246 


Darboux sums and integrals, 344~ 
353 
Decimal system, 2, 321-322 
Decimals “represent” numbers, 
334-339 
Decreasing function, 115, 294-313 
Dedekind axiom, 313, 344 
Dedekind partition, 313 
Definite integrals, 252 
Del (operator V), 561, 566 
Delta (ô) is distance, 6 
Delta (Ax, Ay, Az, Aw), 116 
Demand curves, 333-334 
Density 6, 260, 268, 677, 697, 706 
Density function, 260 
Dependent vectors, 58 
Derivative(s), 153 
basic formulas for, 154-158, 165, 
167 
chain rule for, 166 
change of order of, 556-559 
definition of, 153 
directional, 564, 566, 585 
of elementary functions, 166 
of exponentials, 165, 493 
four-step rule for, 154 
generalized, 323, 619 
of higher order, 170, 179, 304- 
313 


of hyperbolic functions, 506- 
509 


integrals of, 333, 659 

of integrals, 225, 234-235, 562, 
573, 658 

intermediate-value property of, 
333 

of inverse functions, 459-464 

of logarithms, 165, 493 

notations for, 153, 163-164 

partial, 553-586 

of powers, 157, 493 

as rates, 176-193 

as slopes, 168, 285 

tables of, 165, 167, end papers 
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Derivative(s), of trigonometric 
functions, 161, 165, 441-443 
of vectors, 178 
as velocities, 178 
Desargues theorem, 110 
Determinants, 87-97, 98 
Diameter(s), of parabola, 359 
of set, 668 
Difference (Ax, Ay, Az, Ace). 116 
Difference quotients, 153 
Differentiable at x [f (x) exists], 
153 
Differentiable functions, 158, 324- 
334 


Differential(s), 193-201, 568 
Differential equations, 208-210, 
445-447, 465, 496-497, 545 
partial, 559-560 
Differentiation, technique for, 158 
Direction angles, 68 
Direction cosines, 68 
Directional derivative, 564, 566, 
585 
Directrix of conic, 27, 363 
Discontinuous derivatives, 350, 561 
Discontinuous functions, 145-148, 
216, 342 
Discriminant of quadratic, 396 
Disk, 36, 241 
Displacement vector, 178 
Distance, 3, 24, 34, 61, 69 
from point to line, 28 
from point to plane, 80 
in polar coordinates, 534 
Divergence of vector function, 561 
Divergent series, 336, 587, 588, 
603-604 
Division by zero taboo, 4 
Divisors of integers, 120, 605 
Dizzy dancer function, 118, 132, 
142, 151, 215 
Domain and range, 113, 116, 121, 
459 


Dominated series, 591 
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Doodle, 66 

Dot product (see Scalar product) 
Double integrals, 667-695 
Duhamel difficulty, 420 

Dummy variables, 213 
Duplication of cube, 536 
Dwight tables, 206 


Dynamic concept of function, 112 


Earth, curvature of, £35 
orbit of, 373 
radius of, 412-413 
Eccentricity of conic, 363—366 
Economics problems, 333-334 
Eigenvalues of matrix, 96 
Eigenvectors of matrix, 96 
Elastic demand, 333-334 
Elastic rod, 256 
Electric current, 171, 181, 455 
Electromotive force, 146 
Elementary functions, 166, 277 
Elimination of parameters, 287 
Ellipses, 241, 363, 365, 369-379, 
549, 683 
Ellipsoids, 248, 403-404, 569 
Elliptic disk, area of, 241 
Elliptic integrals, 422, 636 
Elliptic reflectors, 377 
Elliptic type, 390, 396 
Empty set, 26 
Epicyclic gears, 401 
Epicycloids, 400—401 
Epsilon-delta assertions, 124, 126, 
133-135, 214, 326, 669, 677, 
685, 695, 697, 707 
Equations, systems of, 90 
Equilateral hyperbolas, 367-368 
Equipotential surfaces, 567, 569 
Eratosthenes, 413 
Error(s), rounding, 281 
(See also Approximations) 
Error functions (see Gauss prob- 
ability functions) 


Index 


Euler, Leonhard, 486, 516 
Euler constant y (gamma), 602, 
644 


Euler formulas, 506, 628, 651 
Euler gamma integral, 521, 685 
Euler integral for z!, 521, 685 
Euler line of triangle, 23 
Euler-Maclaurin formulas, 640-651 
Even function, 118 
F volute of curve, 433 
Existence of Riemann integrals, 
215-216 
Exponent(s), laws of, 481, 483 
Exponential base e, 38, 126, 281, 
493-505 
expressed as limit, 495, 504-505 
irrationality of, 596-597 
Exponential function, 38, 481-525 
calculation of, 488-489 
derivative of, 493 
elementary theory, 490-491 
graph of, 38 
series for, 231, 498-499, 505 
table of, end papers 
Exponential inequalities, 482 
Exponential series, 231, 498, 505 
Exponential (or logarithmic) spiral, 
532, 544 
Extension of partition, 345 
Extrema (see Maxima and minima) 


Factorial(s), 137, 138-140, 228, 
456, 476, 487, 521-524, 607, 
644-651, 685-687 

Factorial integral, 521, 685 

Falling bodies, 42-46, 180-181, 210 

Family of planes, 79 

Feller, William, 643 

Fibonacci sequences, 626-627 

Finite, off-beat meanings, 323 

Finite mathematics, 319, 344 

Finite numbers, 144 

Fishing expedition, 632 


“Fixed” point, 27 
Flea assertions, 125, 132 
Flexpoints, 304-311 
Fluid pressure, 274-275 
Flyspeck notation, 153 
Focal chord of parabola, 359 
Focal square of parabola, 355 
Focus of conic, 27, 363 
Folium of Descartes, 414 
Forces are vectors, 43, 177 
Four-point construction, 392 
Four-step rule, 154 
Fourier analysis, 617 
Fourier coefficients, 612 
Fourier series, 611-619 
Fourier synthesis, 617-619 
Frequency of sinusoid, 182 
Fubini theorem, 670, 686 
Function(s), 111-122 

implicit, 172, 577 

vector, 112 
Fundamental theorem, of algebra, 

517 
of arithmetic, 630 
of calculus, 225-226, 332-333 


Gamma (Euler constant y), 602, 
644 

Gamma integral, 521, 685 

Gas laws, 189, 199 

Gauss approach to factorials, 139 

Gauss probability functions, 253, 
256, 262, 309-310, 402, 525, 
643, 647-650, 693-694 

Gegenbets piel, 142 

Generalized derivatives, 323, 619 

Generating functions, 627, 650-651 

Geodesics, 412 

Geometric angle, 51 

Geometric means, 503-504 

Geometric series, 337—338, 588, 593 

Geometry of cones and conics, 


361-407 
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Global (least-square) approxima- 
tion, 618 
Global maxima and minima, 295 


Gradient (vector), 560, 566, 568, 
658 


Graphs, 35, 66, 179 
containing just one point, 293 
in polar coordinates, 530 
Gravitational attraction, 43, 74, 
149, 263-269, 423-425, 551, 
684-685, 692-693, 706-707, 
711-714 
Gravitational constants, 43, 74 
Gravitational potential, 255 
Greatest element of nonempty 
finite set, 319 
Greatest integer in x, 37, 141, 151, 
320 
Greatest lower bound (g.|.b.), 315 
Green formula, 672—676 


Guns-and-butter interpretation, 40 


Half-life, 497 

Hardy, G. H., 407 

Harmonic functions, 559-560 

Harmonic motion (see Sinusoids) 

Harmonic series, 594, 644 

Heads and tails, 522-523 

Heat equation, 560 

Heaviside function, 118, 147 

Helix, 183, 422 

Hermite polynomials, 173-174 

Holditch theorem, 437 

Homogeneous solids, 272, 697 

Hooke law, 256 

Hyperbolas, 363, 365, 379-392, 508 
equilateral, 367-368 

Hyperbolic functions, 386, 505-511 

Hyperbolic paraboloid, 406-407, 

570 


Hyperbolic spiral, 546 
Hyperbolic type, 390, 396 
Hyperboloids, 404-405 
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Hypocyclic gears, 184, £02 
Hypocycloids, 401-402 
of four cusps, 292, 402, 423, 435, 
628-683 


Identity matrix, 96 
Implicit differentiation, 172, 176 
Implicit functions, 172, 577 
basic theorem on, 583-585 
Improper integrals, 252 
Income tax rates, 187 
Increasing functions, 115, 294-313, 
322, 328 
Increments, 194, 562-563 
Indefinite integrals, 202-212, 225 
Independence of path, 658, 676 
Independent vectors, 58 
Indeterminate forms, 330-332 
Induction (see Mathematical in- 
duction) 
Inequalities, 2-6, 35, 78, 441 
elegant one, 504 
exponential, 482 
geometric mean S arithmetic 
mean, 504 
Schwarz, 78 
triangle, 33 
trigonometric, 441 
Infinite decimals, 336-339 
Infinite integrals (see Riemann- 
Cauchy integrals) 
Infinite series (see Series) 
Infinitesimals, 144, 201, 250 
Infinity, 134, 143-144, 592 
Inflection (see Flexpoints) 
Inner point, 293 
Input (element of domain), 75, 
114, 122 
Instaneous rate (see Rates) 
Integers, 5, 120, 319 
Integrability, 216, 345-353, 609 
Integral(s), 202 
Archimedes, 349 


Index 


Integral(s), change of variable in, 


221-223, 469-479 
curve (or line), 426-428, 673- 
676 
derivatives of, 225, 573 
of derivatives, 333, 659 
double, 667-695 
indefinite, 202-212, 225 
iterated, 652-667 
limits of, 350 
modified Riemann, 223 
multiple, 667-714 
in polar coordinates, 687-695 
Riemann, 214 
Riemann-Cauchy, 250-256, 657, 
667 
Riemann-Stieltjes, 261, 276, 353 
tables of, 205-206, end papers 
of trigonometric functions, 205, 
449-457 
triple, 695-714 
in cylindrical coordinates, 702- 
707 
in rectangular coordinates, 
695-702 
in spherical coordinates, 707- 
714 
of vector functions, 426 
(See also Integration) 


Integral test for convergence, 601- 


603 


Integrand, 203 


Integration, formulas for, 205, end 


papers 
limits of, 214, 470 
by partial fractions, 511—518 
by parts, 210-211, 230, 328, 473, 
477, 518-525, 633, 641, 655 
by substitution, 469 
by substitution z = tan (6/2), 
477-479 
technique for, 205-212 
termwise, 620—621, 672 


by trigonometric substitution, 


471—474 


Integration, by use of tables, 
206-207, 211-212, 465, 474, 
514 
(See also Integrals) 


Intercept form of equation of line, 
14 


Intermediate-value theorem, 316, 


International Business Machine 
Corporation (IBM), 47 
Interval(s), 5 
of convergence, 620 
Intrinsic definitions, 48, 585—586 
Intrinsic equation of conic, 363, 538 
Invariants under rotation, 395 
Inverse functions, 458-469 
basic theorem for, 459 
definition, 458-459 
hyperbolic, 507-508 
trigonometric, 461-464 
Inverse sine, series for, 635 
Inverse-square law, 74-75 
(See also Gravitational attrac- 
tion) 
Inverse tangent, series for, 467 
Involute of curve, 434 


Irrational numbers, 8, 339, 596- 
597 


Irrationality, of e, 596-597 
of square roots, 8 

Irreducible polynomial, 448 

Isometric transformer, 34 

Isothermals, 567 

Iterated integrals, 652-667 

Iterated limits, 632 


Jacobi determinants, 579 
Jordan curve theorem, 417 
Jump discontinuities, 133 


Kepler (planetary motion), 551 
Kinetic energy, 187, 257-259, 263 
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Lagrange form of remainder, 634 
Lagrange multipliers, 572-573 
Laguerre polynomials, 174 
Lamina, 270, 667, 676 
Laplace equation, 559, 562 
Laplace transforms, 686 
Laplacian V?u, 582 
in cylindrical coordinates, 582 
in rectangular coordinates, 582 
in spherical coordinates, 582 
Lattice points, 32, 152 
Law of cosines, 112, 198 
Law of mean (see Mean-value 
theorem) 
Leacock, Stephen, 124 
Least integer in set of positive 
integers, 320 
Least squares, 300 
Least upper bound (l.u.b.), 315 
Lebesgue integrals, 618, 670, 672 
Lebesgue measure, 215, 236, 244, 
283, 670, 696, 706 
Left-hand continuity, 145 
Left-hand derivative, 162 
Leibniz notation, 153, 164, 175 
Leibniz series for 1/4, 468 
Lemniscate (of Bernoulli), 389, 
532, 533, 538, 549 
Length, of circle, 135 
of circular arc, 408 
of curve, 410 
of interval, 5 
in polar coordinates, 542 
in rectangular coordinates, 417- 
423 
in spherical coordinates, 710-711 
Level surfaces, 566 
L’H6pital rules, 330-332 
Limacon of Pascal, 538 
Limit(s), 122-144 
basic theorems on, 127-128, 
715-720 
for functions of n variables, 557 
of integrals, 686 
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Limit(s), of integration, 214, 470 
of powers, 128, 495 
of Riemann sums, 214, 669, 695, 
707 
of sequences, 335, 351 
unilateral, 133-134 
Line integrals (see Curve integrals) 
Linear equations, 15 
Linear transformer, 75 
Lines, 11, 70 
Lines of support, 285, 293-294 
Linkage, 186 
Lituus, 546 
Local maxima and minima, 295, 
305 
Logarithm(s), 168, 484-501 
calculation of, 488, 500 
Logarithmic derivative, 199 
Logarithmic function, 38 
derivative of, 165, 493 
graph of, 38 
table of, end papers 
theory of, 484-501 
Logarithmic series, 500 
Lower bound, 212 
Lower integral (Darboux), 346 
Lower sum (Darboux), 344 


Loxodromes (rhumb curves), 185 


Mach number, 388 
Machin formula for x/4, 468 
Maclaurin, 546, 621 
Marginal cost and revenue, 334 
Mass functions, 259-261 
Mathematical induction, 320 
Mathematics, queen of sciences, 
524 
Matrices, 87 
algebra of, 94-97 
multiplication of, 95 
Matrix transformations, 96 
Maxima and minima, 294-312 
existence of, 315, 333 


Index 


Maxima and minima, for functions 
of n variables, 571-573, 638- 
639 

Mean value of function, 261 

Mean-value theorem, 324 

extended, 328, 332 
generalized, 329 

Measure (Lebesgue), 215, 236, 
244, 283, 670, 696, 706 

Medians of triangle, 19, 24, 54 

Merry-go-round problem, 193 

Mid-point formulas, 5, 13, 53 

Mid-triangle, 22 

Minimum (see Maxima and 
minima) 

Minor of determinant, 87 

Mobius bands, 676 

Moment(s), 120, 260-262, 269- 
275, 423, 677, 690, 697-700 

Moments of inertia (see Moments) 

Monotone functions, 294, 347, 410 

Monotone sequences, 337 

Motion, workaday definitions, 179- 
180 


Multiple integrals (see Double in- 
tegrals; Iterated integrals; 
Triple integrals) 

Multiple point, 417 

Multiplication of series, 623-627 

Multiplicities of zeros, 517 

Museum of SPC, 102 


Napier tables, 485-486 

Nappe of cone, 361 

Natural logarithms (base e) (see 
Logarithms) 

Newton approximation to zeros, 


Newton law of gravitation, 74, 
254, 551, 711-714 
(See also Gravitational attrac- 
tion) 
Newton notation, 153, 164 


Nine-point circle, 23 

Nondifferentiable functions, 162 

Nonintegrable function, 215 

Nonmeasurable sets, 237, 721-723 

Norm |P| of partition, 212, 668 

Normal lines, 17, 78, 290 

Normal to surface, 567 

Normal vectors, 52, 78 

Null set, 223 

Number systems, 143-144 

Numbering systems, 3, 47 

Numerical integration (see Simpson 
approximation to integral) 

Numerical tables, end papers 


Octant, 60 
Odd function, 118 
One-sided (see Unilateral limits) 
Open intervala < x < 5,5 
Operators, 96, 112, 114 
commutative, 556 
Orbit, of earth, 373 
of planets, 551 
Order, of differentiation, 556-559 
of integration, 664, 669, 696 
Ordered pairs, 113 
Orientation of simplex, 97 
Origin, 2, 10 
Orthocenter, 20-23 
Orthogonal vectors, 52, 60 
Orthonormal sets of functions, 456, 
611-613, 616-619 
Orthonormal vectors, 60, 74 
Osculating circle (see Circle, of 
curvature) 
Output (element of range), 75, 114, 
122 
Ovals of Cassini, 538 


Pappus, trisection of angles, 389 
Pappus theorem, 247 
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Parabolas, 27, 31, 40, 240, 354-361, 
363, 399 
Parabolic reflectors, 358 
Parabolic type, 390, 396 
Paraboloids, 62, 247, 406, 425 
Parallel axis theorem, 681 
Parallel lines, 16 
Parallelogram law, 50 
Parameters, 66, 72, 179, 574 
Parametric equations, 66, 72, 179 
Partial derivatives, 553-586 
Partial differential equations, 559- 
560 
Partial fractions, 511-518 
Partial sums of series, 335, 587 
Particle concentrated at point, 75 
Particle sliding on curve, 191, 425 
Partition, Dedekind, 313 
of interval, 212, 344, 410, 706 
of sets in Es, 668, 706 
of sets in Es, 695, 706 
Partition number, 313 
Pascal triangle (binomial coeff- 
cient), 140 
Pendulum problem, 198-199 
Percentage of error, 197 
Periodic functions, 613 
Perpendicular lines, 16, 76 
Perspective triangles, 110 
Perversity of inanimate matter, 
229 
Phase angle of sinusoid, 39 
Pi (x), decimal approximations, 2, 


Picture, angles subtended by, 465- 
466 

Piecewise continuous, 348 

Piecewise monotone, 347 

Plane(s), 78-87 

Plane geometry puzzlers, 110 

Planimeter, 277 

Point-direction equations of lines in 
E;, 70 


Point-slope equation of line, 12 


734 


Polar coordinates, 528-552, 687- 
695 
graphs in, 530 
integrals in, 687-695 
Polar moment of inertia, 681 
Polar points and lines, 390-392 
Polynomials, 115, 148, 448, 517 
Hermite, 173-174 
Laguerre, 174 
Tchebycheff, 465 
Potential, 255, 425 
Potential functions, 568, 658 
Power series, 593, 619-640 
Pressure (fluid), 274-275 
Primary and secondary variables, 
576 
Prime numbers, 120, 143 
series of reciprocals of, 630-631 
Prismoidal formula, 283 
Probability, 58, 300, 522-523, 596, 
643, 647-650, 693-694 
Feller book, 643 
Profits, maximizing, 334 
Projectiles, motion of, 183, 199, 210 
spinning, 402 
Projections, 59 


Quadrants, 11 

Quadratic forms, 393, 572 
Quadratic formula, 160 
Quadric cones, 407 
Quadric sections, 403 


Quadric surfaces, 102, 108, 402- 
407, 570 


Quartic equations, 537 
Queer sets, 283, 721-723 


Quotients, shilling and built-up, 
26-27 


Rademacher functions, 596 
Radial acceleration, 437, 545 


Radian (measure of angles), 50, 
439 
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Radical axis, 32 
Radius, of convergence, 620 
of curvature, 428—435 
Random-walk problem, 58 
Range and domain, 113, 116, 121, 
459 
Range of projectile, 183, 199 
Rare books, 485—486 
Rates, 176-193 
Ratio test, 599-601 
Rational functions, 116, 149, 351 
Rational numbers, 5, 338, 343 
Real numbers, 1 
Rearrangements of series, 597—598 
Rectangles, areas of, 235 
Rectifiable curve, 410 
Rectified sinusoids, 617 
Reduction formulas, 514, 520, 
522 
Regular polygons and circles, 143 
Related rates, 188-193 
Relative (local) maxima and min- 
ima, 295 
Relative motion, 192-193 
Repeated integrals (see Iterated 
integrals) 
Repeating decimals, 338 
Resistors, 200 
Restrictions of functions, 459 
Resultant, 50 
Riemann integrals, 214-250, 652- 
723 
as areas, 348 
basic properties of, 224-227 
existence of, 215-216, 345-353 
Riemann sums, 212, 668, 695, 707 
compared with Darboux sums, 
345—349 
Riemann theory of trigonometric 
series, 618-619 
Riemann zeta function, 603, 607, 
614, 617, 630 
Riemann-Cauchy integrals, 250- 
256, 608—609, 657, 667 


Riemann-Stieltjes integrals, 261, 
276, 353 

Right-hand continuity, 145 

Right-hand derivative, 162 

Right-hand rule, 52 

Right-handed system, 60, 66, 73,99, 


Rolle theorem, 324 

Rollers, triangular, 437 

Root test for series, 629 

Rose of four leaves, 531 

Rosser, John Barkley, 122 

Rotation of axes, 107, 394-402 

Rounding errors, 281 

Ruler and compass constructions, 
389, 536-538, 546 


Rules of our game, 1 


Saddle surfaces, 407 

Sandwich theorem, 128, 719-720 

Saw-tooth function, 37, 232, 613, 
640 


Scalar(s), 43, 49, 177 
Scalar acceleration, 44, 180 
Scalar (dot) product, 51, 67-77 
Scalar triple product, 105 
Scalar velocity, 43, 180 
Schlicht functions, 459 
Schwarz inequality, 78 
Schwarz paradox, 448 
Secant, series for, 623, 624 
Second derivatives, geometric use 
of, 304-312 
Sector, area of, 440, 476 
Semicubical parabola, 413-414 
Senatorial secretaries, 300 
Sequences, 335, 587, 598-599 
Series, 334-342, 587-651 
Abel value of, 340 
absolutely convergent, 590-592 
alternating, 610-611 
binomial, 634-635 
Cesaro value of, 342 
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Series, comparison test, 59] 
convergence value of, 336, 341, 
587 
converging to, Bernoulli func- 
tions, 614 
cosine, 458 
exponential, 231, 498-499, 505 
inverse tangent, 467 
log (1 + x)—}, 500 
1/4, 468, 500 
sine, 458 
tangent, 623, 651 
definition of, 335, 587 
divergent, 336, 587, 588, 603-604 
dominated, 591 
exponential, 231, 498-499, 505 
Fourier, 611-619 
geometric, 337—338, 588, 593 
harmonic, 594, 644 
integral test, 601—603 
multiplication of, 623-627 
partial sums of, 335, 587 
power, 593, 619-640 
ratio test, 599-601 
rearrangements of, 597-598 
root test, 629 
Dk? = r?/6, 594, 605, 614, 672 
Taylor, 621, 633-636 
terms of, 335, 588 
termwise addition, 589 
termwise differentiation, 620, 632 
termwise integration, 620—621, 
672 
trigonometric, 618-619 
uniform convergence, 342 
of vectors, 597 
Serpentine, 259 
Set, bounded, 293 
Shell game, 721-723 
Signum function, 188, 132, 211, 244 
Similarity of parabolas, 360 
Simple closed curve, 417 
Simple harmonic motion (see Sinus- 


oids) 
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Simple points, 585 
Simplex, n-dimensional, 58 
Simpson approximation to integral, 


278-282 


Sine(s), decimal approximations to, 


power series for, 458 
Sinusoids, 39, 182, 184, 442, 445, 
455 
rectified, 617 
Skew lines, line normal to two, 77 
transversals of three, 108 
Slab method, 245 
Slide rules, 489-490 
Slopes, 11, 168, 285, 305 
increasing and decreasing, 305 
Snell formula (optics), 302 
Social security numbers, 113 
Solid having maximum gravita- 
tional attraction, 693 
Space (£3), 48-110 
Span of vectors, 57 
SPC, 102 
Specific heat, 200-201 
Speed, 177, 179, 436-437, 442 
Speidell, John, 486 
Sphere, 62, 550, 573-574 
in conical glass, 303 
Spherical ball, volume of, 246-247 
Spherical coordinates, 527, 533, 582, 
707-714 
triple integrals in, 707-714 
Spheroids, 249, 404 
Spirals, of Archimedes, 532 
exponential, 532, 544 
hyperbolic, 546 
Square roots, 4, 8 
table of, end papers 
Square-wave function, 616-617 
Standard deviation c, 309, 694 
Star characters (intermediate 
points), 212-213 
Statements, 35 
Static concept of function, 142 
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Step functions, 217-219 
Stirling formulas, 487-488, 608, 
644-647 
Stokes formula, 674 
Stream curves, 387 
String property of ellipse, 370 
String unwound from spool, 434 
Strophoid, 546 
Subintervals, 212 
Successive approximation, 9, 318 
Successive elimination, 19 
Sum (convergence value) of series, 
336, 341, 587—588 
Summation symbol Z, 213 
Sums of powers, 233, 331, 643, 644 
Surface, 425, 574 
area of, 425, 447-448, 550 
normals to, 567 
tangent planes of, 567 
triangulation of, 447 
Symbols and numbers, 244 
Symmetry, 37 
of conics, 365 
Systems of equations, 90 


Tables, Burington, 206 
of derivatives, 165, 167, end 
papers 
Dwight, 206 
of integrals, 205-206, end papers 
numerical, end papers 
use of, 206-207, 211-212, 465, 
474, 514 
Tangent lines, 168, 179, 284-294 
Tangent planes, 567 
Tangent (trigonometric), series for, 


623, 651 
Tangent vectors, 179, 286 
in cylindrical coordinates, 540 
in polar coordinates, 541 
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